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Deoxyribonucleic acid (DNA) is a nucleic acid that 
contains the genetic instructions used in the 
development and functioning of all known living 
organisms and some viruses. The main role of DNA 
molecules is the long-term storage of information. 
DNA is often compared to a set of blueprints or a 
recipe, or a code, since it contains the instructions 
needed to construct other components of cells, such 
as proteins and RNA molecules. The DNA segments 
that carry this genetic information are called genes, 
but other DNA sequences have structural purposes, 
or are involved in regulating the use of this genetic 
information. 

Chemically, DNA consists of two long polymers of 

simple units called nucleotides, with backbones made 

of sugars and phosphate groups joined by ester 

bonds. These two strands run in opposite directions 

to each other and are therefore anti-parallel. 

Attached to each sugar is one of four types of 

molecules called bases. It is the sequence of these 

four bases along the backbone that encodes 

information. This information is read using the 

genetic code, which specifies the sequence of the 

amino acids within proteins. The code is read by copying stretches of DNA into the related 

nucleic acid RNA, in a process called transcription. 

Within cells, DNA is organized into X-shaped structures called chromosomes. These 
chromosomes are duplicated before cells divide, in a process called DNA replication. 
Eukaryotic organisms (animals, plants, fungi, and protists) store most of their DNA inside 
the cell nucleus and some of their DNA in the mitochondria (animals and plants) and 
chloroplasts (plants only) L J . Prokaryotes (bacteria and archaea) however, store their DNA 
in the cell's cytoplasm. Within the chromosomes, chromatin proteins such as histones 
compact and organize DNA. These compact structures guide the interactions between DNA 
and other proteins, helping control which parts of the DNA are transcribed. 




The structure of part of a DNA double 
helix 
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Properties 




Strands of purified DNA 
precipitated from solutions of 
cell components are visible as 

viscous white substance. 



DNA is a long polymer made from repeating units called 
nucleotides. ] [ ] [ ] These nucleotides are adenine (A), 
guanine (G), cytosine (C) and thymine (T). In the related 
nucleic acid RNA, thymine is replaced by uracil (U). These 
nucleotides can be classified into two groups: purines 
(adenine and guanine) and pyrimidines (thymine and 
cytosine). 

The DNA chain is 22 to 26 Angstroms wide (2.2 to 
2.6 nanometres), and one nucleotide unit is 3.3 A (0.33 nm) 
long. ] Although each individual repeating unit is very small, 
DNA polymers can be very large molecules containing 
millions of nucleotides. For instance, the largest human 
chromosome, chromosome number 1, is approximately 220 
million base pairs long. 6] 



In living organisms, DNA does 
not usually exist as a single 
molecule, but instead as a pair 
of molecules that are held 
tightly together. [7] [8] These two 
long strands entwine like vines, 
in the shape of a double helix. 
The nucleotide repeats contain 
both the segment of the 
backbone of the molecule, 
which holds the chain together, 
and a base, which interacts with 
the other DNA strand in the 
helix. In general, a base linked 
to a sugar is called a nucleoside 
and a base linked to a sugar and 
one or more phosphate groups 
is called a nucleotide. If 
multiple nucleotides are linked 
together, as in DNA, this 
polymer is called a 

polynucleotide. ] 



Thymine 



3 1 end 

OH 




Phosphate- o v h 2 n 

deoxyribose°7\ >-{ )-VS * 

■ ■■ ( NH N \ r 



backbone 





H2N 






3-°end Cytosine o /^° 

Guanine send 



The chemical structure of DNA. Hydrogen bonds are shown as 
dotted lines. 



The backbone of the DNA 
strand is made from alternating 
phosphate and sugar 

residues. J The sugar in DNA is 2-deoxyribose, which is a pentose (five-carbon) sugar. The 
sugars are joined together by phosphate groups that form phosphodiester bonds between 
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the third and fifth carbon atoms of adjacent sugar rings. These asymmetric bonds mean a 
strand of DNA has a direction. In a double helix the direction of the nucleotides in one 
strand is opposite to their direction in the other strand. This arrangement of DNA strands is 
called antiparallel. The asymmetric ends of DNA strands are referred to as the 5Q (five 
prime) and 3D (three prime) ends, with the 5' end being that with a terminal phosphate 
group and the 3 1 end that with a terminal hydroxyl group. One of the major differences 
between DNA and RNA is the sugar, with 2-deoxyribose being replaced by the alternative 
pentose sugar ribose in RNA. J 

The DNA double helix is stabilized by hydrogen bonds between the bases attached to the 
two strands. The four bases found in DNA are adenine (abbreviated A), cytosine (C), 
guanine (G) and thymine (T). These four bases are attached to the sugar/phosphate to form 
the complete nucleotide, as shown for adenosine monophosphate. 

These bases are classified into two types; adenine and guanine are fused five- and 
six-membered heterocyclic compounds called purines, while cytosine and thymine are 
six-membered rings called pyrimidines. 8] A fifth pyrimidine base, called uracil (U), usually 
takes the place of thymine in RNA and differs from thymine by lacking a methyl group on its 
ring. Uracil is not usually found in DNA, occurring only as a breakdown product of cytosine. 



Grooves 

Twin helical strands form the DNA backbone. Another 
double helix may be found by tracing the spaces, or 
grooves, between the strands. These voids are adjacent 
to the base pairs and may provide a binding site. As the 
strands are not directly opposite each other, the 
grooves are unequally sized. One groove, the major 
groove, is 22 A wide and the other, the minor groove, is 
12 A wide. J The narrowness of the minor groove 
means that the edges of the bases are more accessible 
in the major groove. As a result, proteins like 
transcription factors that can bind to specific sequences 
in double-stranded DNA usually make contacts to the 
sides of the bases exposed in the major groove. ] This 
situation varies in unusual conformations of DNA within 
the cell (see below), but the major and minor grooves 
are always named to reflect the differences in size that 
would be seen if the DNA is twisted back into the 
ordinary B form. 

Base pairing 




Structure of a section of DNA. The 
bases lie horizontally between the two 

spiraling strands. Animated 

version at File: DNA orbit animated.gif 

- over 3 megabytes. 



Each type of base on one strand forms a bond with just 
one type of base on the other strand. This is called 
complementary base pairing. Here, purines form 
hydrogen bonds to pyrimidines, with A bonding only to T, and C bonding only to G. This 

arrangement of two nucleotides binding together across the double helix is called a base 
pair. As hydrogen bonds are not covalent, they can be broken and rejoined relatively easily. 
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The two strands of DNA in a double helix can therefore be pulled apart like a zipper, either 
by a mechanical force or high temperature. J As a result of this complementarity, all the 
information in the double-stranded sequence of a DNA helix is duplicated on each strand, 
which is vital in DNA replication. Indeed, this reversible and specific interaction between 
complementary base pairs is critical for all the functions of DNA in living organisms. 3] 
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Top, a GC base pair with three hydrogen bonds. Bottom, an AT base pair with two 
hydrogen bonds. Non-covalent hydrogen bonds between the pairs are shown as dashed 
lines. 

The two types of base pairs form different numbers of hydrogen bonds, AT forming two 
hydrogen bonds, and GC forming three hydrogen bonds (see figures, left). DNA with high 
GC-content is more stable than DNA with low GC-content, but contrary to popular belief, 
this is not due to the extra hydrogen bond of a GC basepair but rather the contribution of 
stacking interactions (hydrogen bonding merely provides specificity of the pairing, not 
stability). ] As a result, it is both the percentage of GC base pairs and the overall length of 
a DNA double helix that determine the strength of the association between the two strands 
of DNA. Long DNA helices with a high GC content have stronger-interacting strands, while 
short helices with high AT content have weaker-interacting strands. ] In biology, parts of 
the DNA double helix that need to separate easily, such as the TATAAT Pribnow box in 
some promoters, tend to have a high AT content, making the strands easier to pull apart. 7] 
In the laboratory, the strength of this interaction can be measured by finding the 
temperature required to break the hydrogen bonds, their melting temperature (also called 
T value). When all the base pairs in a DNA double helix melt, the strands separate and 
exist in solution as two entirely independent molecules. These single-stranded DNA 
molecules have no single common shape, but some conformations are more stable than 
others. [18] 
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Sense and antisense 

A DNA sequence is called "sense" if its sequence is the same as that of a messenger RNA 
copy that is translated into protein. ] The sequence on the opposite strand is called the 
"antisense" sequence. Both sense and antisense sequences can exist on different parts of 
the same strand of DNA (i.e. both strands contain both sense and antisense sequences). In 
both prokaryotes and eukaryotes, antisense RNA sequences are produced, but the functions 
of these RNAs are not entirely clear. ] One proposal is that antisense RNAs are involved in 
regulating gene expression through RNA- RNA base pairing. J 

A few DNA sequences in prokaryotes and eukaryotes, and more in plasmids and viruses, 
blur the distinction between sense and antisense strands by having overlapping genes. ] 
In these cases, some DNA sequences do double duty, encoding one protein when read along 
one strand, and a second protein when read in the opposite direction along the other 
strand. In bacteria, this overlap may be involved in the regulation of gene transcription, J 
while in viruses, overlapping genes increase the amount of information that can be encoded 
within the small viral genome. J 

Supercoiling 

DNA can be twisted like a rope in a process called DNA supercoiling. With DNA in its 
"relaxed" state, a strand usually circles the axis of the double helix once every 10.4 base 
pairs, but if the DNA is twisted the strands become more tightly or more loosely wound. J 
If the DNA is twisted in the direction of the helix, this is positive supercoiling, and the bases 
are held more tightly together. If they are twisted in the opposite direction, this is negative 
supercoiling, and the bases come apart more easily. In nature, most DNA has slight 
negative supercoiling that is introduced by enzymes called topoisomerases. ] These 
enzymes are also needed to relieve the twisting stresses introduced into DNA strands 
during processes such as transcription and DNA replication. J 



Alternate DNA structures 

DNA exists in many possible 
conformations that include A-DNA, 
B-DNA, and Z-DNA forms, although, 
only B-DNA and Z-DNA have been 
directly observed in functional 
organisms. J The conformation 
that DNA adopts depends on the 
hydration level, DNA sequence, the 
amount and direction of 

supercoiling, chemical modifications 
of the bases, the type and 
concentration of metal ions, as well 
as the presence of polyamines in solution. ] 






From left to right, the structures of A, B and Z DNA 



The first published reports of A-DNA X-ray diffraction patterns— and also B-DNA used 
analyses based on Patterson transforms that provided only a limited amount of structural 
information for oriented fibers of DNA. ] [ ] An alternate analysis was then proposed by 
Wilkins et ah, in 1953, for the in vivo B-DNA X-ray diffraction/scattering patterns of highly 



DNA 



hydrated DNA fibers in terms of squares of Bessel functions. 31] In the same journal, 
Watson and Crick presented their molecular modeling analysis of the DNA X-ray diffraction 
patterns to suggest that the structure was a double-helix.- 1 

Although the B-DNA form' is most common under the conditions found in cells, 32] it is not 
a well-defined conformation but a family of related DNA conformations^ J that occur at the 
high hydration levels present in living cells. Their corresponding X-ray diffraction and 
scattering patterns are characteristic of molecular paracrystals with a significant degree of 
disorder. [34] [35] 

Compared to B-DNA, the A-DNA form is a wider right-handed spiral, with a shallow, wide 
minor groove and a narrower, deeper major groove. The A form occurs under 
non-physiological conditions in partially dehydrated samples of DNA, while in the cell it 
may be produced in hybrid pairings of DNA and RNA strands, as well as in enzyme-DNA 
complexes. 36] [37 - 1 Segments of DNA where the bases have been chemically modified by 
methylation may undergo a larger change in conformation and adopt the Z form. Here, the 
strands turn about the helical axis in a left-handed spiral, the opposite of the more common 
B form. J These unusual structures can be recognized by specific Z-DNA binding proteins 
and may be involved in the regulation of transcription. 39] 



Quadruplex structures 




Structure of a DNA quadruplex formed by telomere repeats. The 
looped conformation of the DNA backbone is very different from 
the typical helical structure. 



At the ends of the linear 
chromosomes are specialized 
regions of DNA called telomeres. 
The main function of these regions 
is to allow the cell to replicate 
chromosome ends using the 
enzyme telomerase, as the 
enzymes that normally replicate 
DNA cannot copy the extreme 3D 
ends of chromosomes. J These 
specialized chromosome caps also 
help protect the DNA ends, and 
stop the DNA repair systems in the 
cell from treating them as damage 
to be corrected. ] In human cells, 
telomeres are usually lengths of 
single-stranded DNA containing 
several thousand repeats of a 



simple TTAGGG sequence 



[43] 



These guanine-rich sequences may 
stabilize chromosome ends by forming structures of stacked sets of four-base units, rather 
than the usual base pairs found in other DNA molecules. Here, four guanine bases form a 
flat plate and these flat four-base units then stack on top of each other, to form a stable 
G- quadruplex structure. ] These structures are stabilized by hydrogen bonding between 
the edges of the bases and chelation of a metal ion in the centre of each four-base unit. 5] 
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Other structures can also be formed, with the central set of four bases coming from either a 
single strand folded around the bases, or several different parallel strands, each 
contributing one base to the central structure. 

In addition to these stacked structures, telomeres also form large loop structures called 
telomere loops, or T-loops. Here, the single-stranded DNA curls around in a long circle 
stabilized by telomere-binding proteins. 6] At the very end of the T-loop, the 
single-stranded telomere DNA is held onto a region of double-stranded DNA by the 
telomere strand disrupting the double-helical DNA and base pairing to one of the two 
strands. This triple-stranded structure is called a displacement loop or D-loop. ] 



Branched DNA 





A DNA structure with multiple branches. 



In DNA fraying occurs when non-complementary regions exist at the end of an otherwise 
complementary double-strand of DNA. However, branched DNA can occur if a third strand 
of DNA is introduced and contains adjoining regions able to hybridize with the frayed 
regions of the pre-existing double-strand. Although the simplest example of branched DNA 
involves only three strands of DNA, complexes involving additional strands and multiple 
branches are also possible. J 



Chemical modifications 
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Structure of cytosine with and without the 5-methyl group. After deamination the 5-methylcytosine 
has the same structure as thymine 

Base modifications 

The expression of genes is influenced by how the DNA is packaged in chromosomes, in a 
structure called chromatin. Base modifications can be involved in packaging, with regions 
that have low or no gene expression usually containing high levels of methylation of 
cytosine bases. For example, cytosine methylation, produces 5-methylcytosine, which is 
important for X-chromosome inactivation. ] The average level of methylation varies 
between organisms - the worm Caenorhabditis elegans lacks cytosine methylation, while 
vertebrates have higher levels, with up to 1% of their DNA containing 5-methylcytosine. ] 
Despite the importance of 5-methylcytosine, it can deaminate to leave a thymine base, 
methylated cytosines are therefore particularly prone to mutations. ] Other base 
modifications include adenine methylation in bacteria, the presence of 
5-hydroxymethylcytosine in the brain, ] and the glycosylation of uracil to produce the 
"J-base" in kinetoplastids. [52] [53] 



Damage 

DNA can be damaged by many different 
sorts of mutagens, which change the DNA 
sequence. Mutagens include oxidizing 
agents, alkylating agents and also 
high-energy electromagnetic radiation such 
as ultraviolet light and X-rays. The type of 
DNA damage produced depends on the 
type of mutagen. For example, UV light can 
damage DNA by producing thymine dimers, 
which are cross-links between pyrimidine 
bases. 5] On the other hand, oxidants such 
as free radicals or hydrogen peroxide 
produce multiple forms of damage, 
including base modifications, particularly of 
guanosine, and double-strand breaks. ] A 
typical human cell contains about 150,000 
bases that have suffered oxidative 
damage. J Of these oxidative lesions, the 
most dangerous are double-strand breaks, 
as these are difficult to repair and can 
produce point mutations, insertions and 
deletions from the DNA sequence, as well 
as chromosomal translocations. ] 




A covalent adduct between benzo[a]pyrene, the major 
mutagen in tobacco smoke, and DNA 



Many mutagens fit into the space between two adjacent base pairs, this is called 
intercalating. Most intercalators are aromatic and planar molecules, and include Ethidium 
bromide, daunomycin, and doxorubicin. In order for an intercalator to fit between base 
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pairs, the bases must separate, distorting the DNA strands by unwinding of the double 
helix. This inhibits both transcription and DNA replication, causing toxicity and mutations. 
As a result, DNA intercalators are often carcinogens, and Benzo[a]pyrene diol epoxide, 
acridines, aflatoxin and ethidium bromide are well-known examples. J L J L J 
Nevertheless, due to their ability to inhibit DNA transcription and replication, other similar 
toxins are also used in chemotherapy to inhibit rapidly growing cancer cells. J 

Biological functions 

DNA usually occurs as linear chromosomes in eukaryotes, and circular chromosomes in 
prokaryotes. The set of chromosomes in a cell makes up its genome; the human genome has 
approximately 3 billion base pairs of DNA arranged into 46 chromosomes. J The 
information carried by DNA is held in the sequence of pieces of DNA called genes. 
Transmission of genetic information in genes is achieved via complementary base pairing. 
For example, in transcription, when a cell uses the information in a gene, the DNA 
sequence is copied into a complementary RNA sequence through the attraction between 
the DNA and the correct RNA nucleotides. Usually, this RNA copy is then used to make a 
matching protein sequence in a process called translation which depends on the same 
interaction between RNA nucleotides. Alternatively, a cell may simply copy its genetic 
information in a process called DNA replication. The details of these functions are covered 
in other articles; here we focus on the interactions between DNA and other molecules that 
mediate the function of the genome. 

Genes and genomes 

Genomic DNA is located in the cell nucleus of eukaryotes, as well as small amounts in 
mitochondria and chloroplasts. In prokaryotes, the DNA is held within an irregularly shaped 
body in the cytoplasm called the nucleoid. J The genetic information in a genome is held 
within genes, and the complete set of this information in an organism is called its genotype. 
A gene is a unit of heredity and is a region of DNA that influences a particular 
characteristic in an organism. Genes contain an open reading frame that can be 
transcribed, as well as regulatory sequences such as promoters and enhancers, which 
control the transcription of the open reading frame. 

In many species, only a small fraction of the total sequence of the genome encodes protein. 
For example, only about 1.5% of the human genome consists of protein-coding exons, with 
over 50% of human DNA consisting of non-coding repetitive sequences. 65] The reasons for 
the presence of so much non-coding DNA in eukaryotic genomes and the extraordinary 
differences in genome size, or C-value, among species represent a long-standing puzzle 
known as the "C-value enigma. " L J However, DNA sequences that do not code protein may 
still encode functional non-coding RNA molecules, which are involved in the regulation of 
gene expression. 7] 
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T7 RNA polymerase (blue) producing a mRNA (green) from a 
DNA template (orange). 



and divergence. J 



Some non-coding DNA sequences 
play structural roles in 
chromosomes. Telomeres and 
centromeres typically contain few 
genes, but are important for the 
function and stability of 
chromosomes. J L J An abundant 
form of non-coding DNA in humans 
are pseudogenes, which are copies 
of genes that have been disabled 
by mutation. ] These sequences 
are usually just molecular fossils, 
although they can occasionally 
serve as raw genetic material for 
the creation of new genes through 
the process of gene duplication 



Transcription and translation 

A gene is a sequence of DNA that contains genetic information and can influence the 
phenotype of an organism. Within a gene, the sequence of bases along a DNA strand 
defines a messenger RNA sequence, which then defines one or more protein sequences. 
The relationship between the nucleotide sequences of genes and the amino-acid sequences 
of proteins is determined by the rules of translation, known collectively as the genetic code. 
The genetic code consists of three-letter 'words' called codons formed from a sequence of 
three nucleotides (e.g. ACT, CAG, TTT). 

In transcription, the codons of a gene are copied into messenger RNA by RNA polymerase. 
This RNA copy is then decoded by a ribosome that reads the RNA sequence by base-pairing 
the messenger RNA to transfer RNA, which carries amino acids. Since there are 4 bases in 
3-letter combinations, there are 64 possible codons ( 4 J combinations). These encode the 
twenty standard amino acids, giving most amino acids more than one possible codon. There 
are also three 'stop' or 'nonsense' codons signifying the end of the coding region; these are 
the TAA, TGA and TAG codons. 
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DNA Polymerase (Pola) 



Lagging 
strand 




Okazaki fragment 



Leading 
strand 




DNA Polymerase fPolSt f A 



Topoisomerase 



DNA Polymerase {Pol5) 

Hellcase' 



Single strand, 

Binding proteins 

DNA replication. The double helix is unwound by a helicase and 

topoisomerase. Next, one DNA polymerase produces the leading 

strand copy. Another DNA polymerase binds to the lagging strand. 

This enzyme makes discontinuous segments (called Okazaki 

fragments) before DNA ligase joins them together. 



Replication 

Cell division is essential for an 

organism to grow, but when a 

cell divides it must replicate 

the DNA in its genome so that 

the two daughter cells have 

the same genetic information 

as their parent. The 

double-stranded structure of 

DNA provides a simple 

mechanism for DNA 

replication. Here, the two 

strands are separated and 

then each strand's 

complementary DNA sequence 

is recreated by an enzyme called DNA polymerase. This enzyme makes the complementary 

strand by finding the correct base through complementary base pairing, and bonding it 

onto the original strand. As DNA polymerases can only extend a DNA strand in a 5Q to 3D 

direction, different mechanisms are used to copy the antiparallel strands of the double 

helix. ] In this way, the base on the old strand dictates which base appears on the new 

strand, and the cell ends up with a perfect copy of its DNA. 

Interactions with proteins 

All the functions of DNA depend on interactions with proteins. These protein interactions 
can be non-specific, or the protein can bind specifically to a single DNA sequence. Enzymes 
can also bind to DNA and of these, the polymerases that copy the DNA base sequence in 
transcription and DNA replication are particularly important. 

DNA-binding proteins 
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Interaction of DNA with histones (shown in white, top). These proteins' basic amino acids 
(below left, blue) bind to the acidic phosphate groups on DNA (below right, red). 

Structural proteins that bind DNA are well-understood examples of non-specific 
DNA-protein interactions. Within chromosomes, DNA is held in complexes with structural 
proteins. These proteins organize the DNA into a compact structure called chromatin. In 
eukaryotes this structure involves DNA binding to a complex of small basic proteins called 
histones, while in prokaryotes multiple types of proteins are involved. J L J The histones 
form a disk-shaped complex called a nucleosome, which contains two complete turns of 
double-stranded DNA wrapped around its surface. These non-specific interactions are 
formed through basic residues in the histones making ionic bonds to the acidic 
sugar-phosphate backbone of the DNA, and are therefore largely independent of the base 
sequence. 75] Chemical modifications of these basic amino acid residues include 
methylation, phosphorylation and acetylation. J These chemical changes alter the strength 
of the interaction between the DNA and the histones, making the DNA more or less 
accessible to transcription factors and changing the rate of transcription. J Other 
non-specific DNA-binding proteins in chromatin include the high-mobility group proteins, 
which bind to bent or distorted DNA. J These proteins are important in bending arrays of 
nucleosomes and arranging them into the larger structures that make up chromosomes. 79] 

A distinct group of DNA-binding proteins are the DNA-binding proteins that specifically 
bind single-stranded DNA. In humans, replication protein A is the best-understood member 
of this family and is used in processes where the double helix is separated, including DNA 
replication, recombination and DNA repair. J These binding proteins seem to stabilize 
single-stranded DNA and protect it from forming stem-loops or being degraded by 
nucleases. 

In contrast, other proteins have evolved to bind to 
particular DNA sequences. The most intensively 
studied of these are the various transcription factors, 
which are proteins that regulate transcription. Each 
transcription factor binds to one particular set of DNA 
sequences and activates or inhibits the transcription of 
genes that have these sequences close to their 
promoters. The transcription factors do this in two 
ways. Firstly, they can bind the RNA polymerase 
responsible for transcription, either directly or through 
other mediator proteins; this locates the polymerase at 
the promoter and allows it to begin transcription. 82] 
Alternatively, transcription factors can bind enzymes 
that modify the histones at the promoter; this will 
change the accessibility of the DNA template to the 
polymerase. 83] 




As these DNA targets can occur throughout an 
organism's genome, changes in the activity of one type 
of transcription factor can affect thousands of 
genes. ] Consequently, these proteins are often the 



The lambda repressor helix-turn-helix 
transcription factor bound to its DNA 
target 



targets of the signal transduction processes that control responses to environmental 
changes or cellular differentiation and development. The specificity of these transcription 
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factors' interactions with DNA come from the proteins making multiple contacts to the 
edges of the DNA bases, allowing them to "read" the DNA sequence. Most of these 



base-interactions are made in the major groove, where the bases are most accessible 



[85] 




The restriction enzyme EcoRV (green) in a complex 



with its substrate DNA 



DNA-modifying enzymes 

Nucleases and ligases 

Nucleases are enzymes that cut DNA 
strands by catalyzing the hydrolysis of the 
phosphodiester bonds. Nucleases that 
hydrolyse nucleotides from the ends of 
DNA strands are called exonucleases, 
while endonucleases cut within strands. 
The most frequently used nucleases in 
molecular biology are the restriction 
endonucleases, which cut DNA at specific 
sequences. For instance, the EcoRV 
enzyme shown to the left recognizes the 
6-base sequence 5Q-GAT|ATC-3Q and makes a cut at the vertical line. In nature, these 
enzymes protect bacteria against phage infection by digesting the phage DNA when it 
enters the bacterial cell, acting as part of the restriction modification system. 7] In 
technology, these sequence-specific nucleases are used in molecular cloning and DNA 
fingerprinting. 

Enzymes called DNA ligases can rejoin cut or broken DNA strands. ] Ligases are 
particularly important in lagging strand DNA replication, as they join together the short 
segments of DNA produced at the replication fork into a complete copy of the DNA 
template. They are also used in DNA repair and genetic recombination. J 

Topoisomerases and helicases 

Topoisomerases are enzymes with both nuclease and ligase activity. These proteins change 
the amount of supercoiling in DNA. Some of these enzyme work by cutting the DNA helix 
and allowing one section to rotate, thereby reducing its level of supercoiling; the enzyme 
then seals the DNA break. J Other types of these enzymes are capable of cutting one DNA 
helix and then passing a second strand of DNA through this break, before rejoining the 
helix. J Topoisomerases are required for many processes involving DNA, such as DNA 
replication and transcription. 7] 

Helicases are proteins that are a type of molecular motor. They use the chemical energy in 
nucleoside triphosphates, predominantly ATP, to break hydrogen bonds between bases and 
unwind the DNA double helix into single strands. ] These enzymes are essential for most 
processes where enzymes need to access the DNA bases. 
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Polymerases 

Polymerases are enzymes that synthesize polynucleotide chains from nucleoside 
triphosphates. The sequence of their products are copies of existing polynucleotide chains - 
which are called templates. These enzymes function by adding nucleotides onto the 3D 
hydroxyl group of the previous nucleotide in a DNA strand. Consequently, all polymerases 
work in a 5Q to 3D direction. 91] In the active site of these enzymes, the incoming nucleoside 
triphosphate base-pairs to the template: this allows polymerases to accurately synthesize 
the complementary strand of their template. Polymerases are classified according to the 
type of template that they use. 

In DNA replication, a DNA-dependent DNA polymerase makes a copy of a DNA sequence. 
Accuracy is vital in this process, so many of these polymerases have a proofreading activity. 
Here, the polymerase recognizes the occasional mistakes in the synthesis reaction by the 
lack of base pairing between the mismatched nucleotides. If a mismatch is detected, a 3D to 
5D exonuclease activity is activated and the incorrect base removed. J In most organisms 
DNA polymerases function in a large complex called the replisome that contains multiple 
accessory subunits, such as the DNA clamp or helicases. J 

RNA-dependent DNA polymerases are a specialized class of polymerases that copy the 
sequence of an RNA strand into DNA. They include reverse transcriptase, which is a viral 
enzyme involved in the infection of cells by retroviruses, and telomerase, which is required 
for the replication of telomeres. 1] [94] Telomerase is an unusual polymerase because it 
contains its own RNA template as part of its structure. J 

Transcription is carried out by a DNA-dependent RNA polymerase that copies the sequence 
of a DNA strand into RNA. To begin transcribing a gene, the RNA polymerase binds to a 
sequence of DNA called a promoter and separates the DNA strands. It then copies the gene 
sequence into a messenger RNA transcript until it reaches a region of DNA called the 
terminator, where it halts and detaches from the DNA. As with human DNA-dependent DNA 
polymerases, RNA polymerase II, the enzyme that transcribes most of the genes in the 
human genome, operates as part of a large protein complex with multiple regulatory and 
accessory subunits. 95] 

Genetic recombination 
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Structure of the Holliday junction intermediate in genetic recombination. The four separate 
DNA strands are coloured red, blue, green and yellow. 96] 

A DNA helix usually does not interact with 
other segments of DNA, and in human cells 
the different chromosomes even occupy 
separate areas in the nucleus called 
"chromosome territories". 7 - 1 This physical 
separation of different chromosomes is 
important for the ability of DNA to function 
as a stable repository for information, as 
one of the few times chromosomes interact 
is during chromosomal crossover when 
they recombine. Chromosomal crossover is 
when two DNA helices break, swap a 
section and then rejoin. 
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Recombination involves the breakage and rejoining of 

two chromosomes (M and F) to produce two 

re-arranged chromosomes (CI and C2). 



Recombination allows chromosomes to exchange genetic information and produces new 
combinations of genes, which increases the efficiency of natural selection and can be 
important in the rapid evolution of new proteins. J Genetic recombination can also be 
involved in DNA repair, particularly in the cell's response to double-strand breaks. ] 

The most common form of chromosomal crossover is homologous recombination, where the 
two chromosomes involved share very similar sequences. Non-homologous recombination 
can be damaging to cells, as it can produce chromosomal translocations and genetic 
abnormalities. The recombination reaction is catalyzed by enzymes known as recombinases, 
such as RAD51. J The first step in recombination is a double-stranded break either 
caused by an endonuclease or damage to the DNA. L J A series of steps catalyzed in part 
by the recombinase then leads to joining of the two helices by at least one Holliday 
junction, in which a segment of a single strand in each helix is annealed to the 
complementary strand in the other helix. The Holliday junction is a tetrahedral junction 
structure that can be moved along the pair of chromosomes, swapping one strand for 
another. The recombination reaction is then halted by cleavage of the junction and 
re-ligation of the released DNA. [102] 
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Evolution 

DNA contains the genetic information that allows all modern living things to function, grow 
and reproduce. However, it is unclear how long in the 4-billion-year history of life DNA has 
performed this function, as it has been proposed that the earliest forms of life may have 
used RNA as their genetic material. J L J RNA may have acted as the central part of early 
cell metabolism as it can both transmit genetic information and carry out catalysis as part 
of ribozymes. J This ancient RNA world where nucleic acid would have been used for 
both catalysis and genetics may have influenced the evolution of the current genetic code 
based on four nucleotide bases. This would occur since the number of unique bases in such 
an organism is a trade-off between a small number of bases increasing replication accuracy 
and a large number of bases increasing the catalytic efficiency of ribozymes. J 

Unfortunately, there is no direct evidence of ancient genetic systems, as recovery of DNA 
from most fossils is impossible. This is because DNA will survive in the environment for less 
than one million years and slowly degrades into short fragments in solution. J Claims for 
older DNA have been made, most notably a report of the isolation of a viable bacterium 
from a salt crystal 250-million years old, J but these claims are controversial. J L J 

Uses in technology 

Genetic engineering 

Methods have been developed to purify DNA from organisms, such as phenol-chloroform 
extraction and manipulate it in the laboratory, such as restriction digests and the 
polymerase chain reaction. Modern biology and biochemistry make intensive use of these 
techniques in recombinant DNA technology. Recombinant DNA is a man-made DNA 
sequence that has been assembled from other DNA sequences. They can be transformed 
into organisms in the form of plasmids or in the appropriate format, by using a viral 

vector. ] The genetically modified organisms produced can be used to produce products 

rim ni2i 

such as recombinant proteins, used in medical research/ J or be grown in agriculture. 1 J 

[113] 

Forensics 

Forensic scientists can use DNA in blood, semen, skin, saliva or hair found at a crime scene 
to identify a matching DNA of an individual, such as a perpetrator. This process is called 
genetic fingerprinting, or more accurately, DNA profiling. In DNA profiling, the lengths of 
variable sections of repetitive DNA, such as short tandem repeats and minisatellites, are 
compared between people. This method is usually an extremely reliable technique for 
identifying a matching DNA. J However, identification can be complicated if the scene is 
contaminated with DNA from several people. 5] DNA profiling was developed in 1984 by 
British geneticist Sir Alec Jeffreys, J and first used in forensic science to convict Colin 
Pitchfork in the 1988 Enderby murders case. 7] 

People convicted of certain types of crimes may be required to provide a sample of DNA for 
a database. This has helped investigators solve old cases where only a DNA sample was 
obtained from the scene. DNA profiling can also be used to identify victims of mass casualty 
incidents. J On the other hand, many convicted people have been released from prison on 
the basis of DNA techniques, which were not available when a crime had originally been 
committed. 
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Bioinformatics 

Bioinformatics involves the manipulation, searching, and data mining of DNA sequence 
data. The development of techniques to store and search DNA sequences have led to widely 
applied advances in computer science, especially string searching algorithms, machine 
learning and database theory. ] String searching or matching algorithms, which find an 
occurrence of a sequence of letters inside a larger sequence of letters, were developed to 
search for specific sequences of nucleotides. ] In other applications such as text editors, 
even simple algorithms for this problem usually suffice, but DNA sequences cause these 
algorithms to exhibit near-worst-case behaviour due to their small number of distinct 
characters. The related problem of sequence alignment aims to identify homologous 
sequences and locate the specific mutations that make them distinct. These techniques, 
especially multiple sequence alignment, are used in studying phylogenetic relationships and 
protein function. ] Data sets representing entire genomes' worth of DNA sequences, such 
as those produced by the Human Genome Project, are difficult to use without annotations, 
which label the locations of genes and regulatory elements on each chromosome. Regions 
of DNA sequence that have the characteristic patterns associated with protein- or 
RNA-coding genes can be identified by gene finding algorithms, which allow researchers to 
predict the presence of particular gene products in an organism even before they have been 
isolated experimentally. ] 



DNA nanotechnology 

DNA nanotechnology uses the 
unique molecular recognition 
properties of DNA and other 
nucleic acids to create 
self-assembling branched DNA 
complexes with useful 

properties. [124] DNA is thus 
used as a structural material 
rather than as a carrier of 
biological information. This 
has led to the creation of 
two-dimensional periodic 

lattices (both tile-based as well 
as using the "DNA origami" 
method) as well as 
three-dimensional structures 
in the shapes of 




The DNA structure at left (schematic shown) will self-assemble into 

the structure visualized by atomic force microscopy at right. DNA 

nanotechnology is the field which seeks to design nanoscale structures 

using the molecular recognition properties of DNA molecules. Image 

from Strong, 2004.[123] 



polyhedra. 125] Nanomechanical devices and algorithmic self-assembly have also been 
demonstrated, J and these DNA structures have been used to template the arrangement 
of other molecules such as gold nanoparticles and streptavidin proteins. 127 - 1 
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History and anthropology 

Because DNA collects mutations over time, which are then inherited, it contains historical 
information and by comparing DNA sequences, geneticists can infer the evolutionary 
history of organisms, their phylogeny. J This field of phylogenetics is a powerful tool in 
evolutionary biology. If DNA sequences within a species are compared, population 
geneticists can learn the history of particular populations. This can be used in studies 
ranging from ecological genetics to anthropology; for example, DNA evidence is being used 
to try to identify the Ten Lost Tribes of Israel. [129] [130] 

DNA has also been used to look at modern family relationships, such as establishing family 
relationships between the descendants of Sally Hemings and Thomas Jefferson. This usage 
is closely related to the use of DNA in criminal investigations detailed above. Indeed, some 
criminal investigations have been solved when DNA from crime scenes has matched 
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relatives of the guilty individual. 1 J 



History of DNA research 

DNA was first isolated by the Swiss physician Friedrich Miescher who, in 1869, discovered 
a microscopic substance in the pus of discarded surgical bandages. As it resided in the 
nuclei of cells, he called it "nuclein". J In 1919, Phoebus Levene identified the base, 
sugar and phosphate nucleotide unit. ] Levene suggested that DNA consisted of a string 
of nucleotide units linked together through the phosphate groups. However, Levene 
thought the chain was short and the bases repeated in a fixed order. In 1937 William 
Astbury produced the first X-ray diffraction patterns that showed that DNA had a regular 
structure. [1 34] 

In 1928, Frederick Griffith discovered that traits of the "smooth" form of the Pneumococcus 
could be transferred to the "rough" form of the same bacteria by mixing killed "smooth" 
bacteria with the live "rough" form. 5] This system provided the first clear suggestion that 
DNA carried genetic information— the Avery-MacLeod-McCarty experiment— when Oswald 
Avery, along with coworkers Colin MacLeod and Maclyn McCarty, identified DNA as the 
transforming principle in 1943. J DNA's role in heredity was confirmed in 1952, when 
Alfred Hershey and Martha Chase in the Hershey-Chase experiment showed that DNA is 
the genetic material of the T2 phage. ] 




James D. Watson 



Rosalind Franklin 
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DNA Helix controversy 



Raymond Gosling 



In 1953 James D. Watson and Francis Crick suggested what is now accepted as the first 
correct double-helix model of DNA structure in the journal Nature} J Their double-helix, 
molecular model of DNA was then based on a single X-ray diffraction image (labeled as 
"Photo 51 "y J taken by Rosalind Franklin and Raymond Gosling in May 1952, as well as 
the information that the DNA bases were paired— also obtained through private 
communications from Erwin Chargaff in the previous years. Chargaffs rules played a very 
important role in establishing double-helix configurations for B-DNA as well as A-DNA. 

Experimental evidence supporting the Watson and Crick model were published in a series 
of five articles in the same issue of Nature} J Of these, Franklin and Gosling's paper was 
the first publication of their own X-ray diffraction data and original analysis method that 
partially supported the Watson and Crick model L J L J ; this issue also contained an article 
on DNA structure by Maurice Wilkins and two of his colleagues, whose analysis and in vivo 
B-DNA X-ray patterns also supported the presence in vivo of the double-helical DNA 
configurations as proposed by Crick and Watson for their double-helix molecular model of 
DNA in the previous two pages of Nature} J In 1962, after Franklin's death, Watson, Crick, 
and Wilkins jointly received the Nobel Prize in Physiology or Medicine. ] Unfortunately, 
Nobel rules of the time allowed only living recipients, but a vigorous debate continues on 
who should receive credit for the discovery. ] 

In an influential presentation in 1957, Crick laid out the "Central Dogma" of molecular 
biology, which foretold the relationship between DNA, RNA, and proteins, and articulated 
the "adaptor hypothesis". ] Final confirmation of the replication mechanism that was 
implied by the double-helical structure followed in 1958 through the Meselson-Stahl 
experiment. ] Further work by Crick and coworkers showed that the genetic code was 
based on non-overlapping triplets of bases, called codons, allowing Har Gobind Khorana, 



Robert W. Holley and Marshall Warren Nirenberg to decipher the genetic code, 
findings represent the birth of molecular biology. 



[145] 
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Molecular models of DNA 

Molecular models of DNA structures are representations of the molecular geometry and 
topology of Deoxyribonucleic acid (DNA) molecules using one of several means, such as: 
closely packed spheres (CPK models) made of plastic, metal wires for 'skeletal models', 
graphic computations and animations by computers, artistic rendering, and so on, with the 
aim of simplifying and presenting the essential, physical and chemical, properties of DNA 
molecular structures either in vivo or in vitro. Computer molecular models also allow 
animations and molecular dynamics simulations that are very important for understanding 
how DNA functions in vivo. Thus, an old standing dynamic problem is how DNA 
"self-replication" takes place in living cells that should involve transient uncoiling of 
supercoiled DNA fibers. Although DNA consists of relatively rigid, very large elongated 
biopolymer molecules called "fibers" or chains (that are made of repeating nucleotide units 
of four basic types, attached to deoxyribose and phosphate groups), its molecular structure 
in vivo undergoes dynamic configuration changes that involve dynamically attached water 
molecules and ions. Supercoiling, packing with histones in chromosome structures, and 
other such supramolecular aspects also involve in vivo DNA topology which is even more 
complex than DNA molecular geometry, thus turning molecular modeling of DNA into an 
especially challenging problem for both molecular biologists and biotechnologists. Like 
other large molecules and biopolymers, DNA often exists in multiple stable geometries (that 
is, it exhibits conformational isomerism) and configurational, quantum states which are 
close to each other in energy on the potential energy surface of the DNA molecule. Such 
geometries can also be computed, at least in principle, by employing ab initio quantum 
chemistry methods that have high accuracy for small molecules. Such quantum geometries 
define an important class of ab initio molecular models of DNA whose exploration has 
barely started. 

In an interesting twist of roles, the DNA molecule itself was proposed to 
be utilized for quantum computing. Both DNA nanostructures as well as 
DNA 'computing' biochips have been built (see biochip image at right). 

The more advanced, computer-based molecular models of DNA involve 
molecular dynamics simulations as well as quantum mechanical 
computations of vibro-rotations, delocalized molecular orbitals (MOs), 
electric dipole moments, hydrogen-bonding, and so on. 




DNA computing 
biochip: 3D 
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Importance 

From the very early stages of structural studies of DNA by X-ray 
diffraction and biochemical means, molecular models such as the 
Watson-Crick double-helix model were successfully employed to solve the 
'puzzle' of DNA structure, and also find how the latter relates to its key 
functions in living cells. The first high quality X-ray diffraction patterns 
of A-DNA were reported by Rosalind Franklin and Raymond Gosling in 
1953 L . The first calculations of the Fourier transform of an atomic helix 
were reported one year earlier by Cochran, Crick and Vand [2] , and were 
followed in 1953 by the computation of the Fourier transform of a 
coiled-coil by Crick [3] . The first reports of a double-helix molecular 
model of B-DNA structure were made by Watson and Crick in 1953 L J L . 
Last-but-not-least, Maurice F. Wilkins, A. Stokes and H.R. Wilson, 
reported the first X-ray patterns of in vivo B-DNA in partially oriented 
salmon sperm heads [6] . The development of the first correct 
double-helix molecular model of DNA by Crick and Watson may not have 

been possible without the biochemical evidence for the nucleotide base-pairing ([A— T]; 

[C-G]), or Chargaff s rules [7] [8] [9] [10] [11] [12] . 




Spinning DNA 
generic model. 



Examples of DNA molecular models 

Animated molecular models allow one to visually explore the three-dimensional (3D) 
structure of DNA. The first DNA model is a space-filling, or CPK, model of the DNA 
double-helix whereas the third is an animated wire, or skeletal type, molecular model of 
DNA. The last two DNA molecular models in this series depict quadruplex DNA L J that 
may be involved in certain cancers [ ] [ 5] . The last figure on this panel is a molecular 
model of hydrogen bonds between water molecules in ice that are similar to those found in 
DNA. 
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Hydrogen 




• Spacefilling model or CPK model - a molecule is represented by overlapping spheres 
representing the atoms. 
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Images for DNA Structure Determination from X-Ray 
Patterns 

The following images illustrate both the principles and the main steps involved in 
generating structural information from X-ray diffraction studies of oriented DNA fibers with 
the help of molecular models of DNA that are combined with crystallographic and 
mathematical analysis of the X-ray patterns. From left to right the gallery of images shows: 

• First row: 

• 1. Constructive X-ray interference, or diffraction, following Bragg's Law of X-ray 
"reflection by the crystal planes"; 

• 2. A comparison of A-DNA (crystalline) and highly hydrated B-DNA (paracrystalline) X-ray 
diffraction, and respectively, X-ray scattering patterns (courtesy of Dr. Herbert R. Wilson, 
FRS- see refs. list); 

• 3. Purified DNA precipitated in a water jug; 

• 4. The major steps involved in DNA structure determination by X-ray crystallography 
showing the important role played by molecular models of DNA structure in this iterative, 
structure-determination process; 

• Second row: 
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• 5. Photo of a modern X-ray diffractometer employed for recording X-ray patterns of DNA 
with major components: X-ray source, goniometer, sample holder, X-ray detector and/or 
plate holder; 

• 6. Illustrated animation of an X-ray goniometer; 

• 7. X-ray detector at the SLAC synchrotron facility; 

• 8. Neutron scattering facility at ISIS in UK; 

• Third and fourth rows: Molecular models of DNA structure at various scales; figure 
#11 is an actual electron micrograph of a DNA fiber bundle, presumably of a single 
bacterial chromosome loop. 
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Paracrystalline lattice models of B-DNA structures 

A paracrystalline lattice, or paracrystal, is a molecular or atomic lattice with significant 
amounts (e.g., larger than a few percent) of partial disordering of molecular 
arranegements. Limiting cases of the paracrystal model are nanostructures, such as 
glasses, liquids, etc., that may possess only local ordering and no global order. Liquid 
crystals also have paracrystalline rather than crystalline structures. 
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DNA Helix controversy in 1952 
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Highly hydrated B-DNA occurs naturally in living cells in such a paracrystalline state, which 
is a dynamic one in spite of the relatively rigid DNA double-helix stabilized by parallel 
hydrogen bonds between the nucleotide base-pairs in the two complementary, helical DNA 
chains (see figures). For simplicity most DNA molecular models ommit both water and ions 
dynamically bound to B-DNA, and are thus less useful for understanding the dynamic 
behaviors of B-DNA in vivo. The physical and mathematical analysis of X-ray 1 - J L J and 
spectroscopic data for paracrystalline B-DNA is therefore much more complicated than that 
of crystalline, A-DNA X-ray diffraction patterns. The paracrystal model is also important for 
DNA technological applications such as DNA nanotechnology. Novel techniques that 
combine X-ray diffraction of DNA with X-ray microscopy in hydrated living cells are now 
also being developed (see, for example, "Application of X-ray microscopy in the analysis of 
living hydrated cells" [18] ). 

Genomic and Biotechnology Applications of DNA molecular 
modeling 

The following gallery of images illustrates various uses of DNA molecular modeling in 
Genomics and Biotechnology research applications from DNA repair to PCR and DNA 
nanostructures; each slide contains its own explanation and/or details. The first slide 
presents an overview of DNA applications, including DNA molecular models, with emphasis 
on Genomics and Biotechnology. 

Gallery: DNA Molecular modeling applications 
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Databases for DNA molecular models and sequences 



X-ray diffraction 

• NDB ID: UD0017 Database [13] 

• X-ray Atlas -database [19] 

• PDB files of coordinates for nucleic acid structures from X-ray diffraction by NA (incl. 
DNA) crystals [20] 

• Structure factors dowloadable files in CIF format [21] 
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Neutron scattering 

• ISIS neutron source 

• ISIS pulsed neutron source:A world centre for science with neutrons & muons at 
Harwell, near Oxford, UK. [22] 

X-ray microscopy 

• Application of X-ray microscopy in the analysis of living hydrated cells L J 

Electron microscopy 

• DNA under electron microscope [ ] 

Atomic Force Microscopy (AFM) 

Two-dimensional DNA junction arrays have been visualized by Atomic Force Microscopy 
(AFM) L J . Other imaging resources for AFM/Scanning probe microscopy(SPM) can be 
freely accessed at: 

• How SPM Works [25] 

• SPM Image Gallery - AFM STM SEM MFM NSOM and more. [26] 

Gallery of AFM Images 




Sample Surface 
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Mass spectrometry—Maldi informatics 
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Spectroscopy 

• Vibrational circular dichroism (VCD) 

• FT-NMR [27] [28] 

• NMR Atlas-database [29] 

• mmcif downloadable coordinate files of nucleic acids in solution from 2D-FT NMR data 

[30] 

• NMR constraints files for NAs in PDB format [31] 
NMR microscopy [32] 
Microwave spectroscopy 
FT-IR 

p T _ NIR [33] [34] [35] 

Spectral, Hyperspectral, and Chemical imaging) [36] [37] [38] [39] [40] [41] [42] . 

Raman spectroscopy/microscopy [ ] and CARS [ ] . 

Fluorescence correlation spectroscopy [45] [46] [47] [48] [49] [50] [51] [52] , Fluorescence 

cross-correlation spectroscopy and FRET [5 ] [5 ] [55] . 

Confocal microscopy^ J 
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Gallery: CARS (Raman spectroscopy), Fluorescence confocal 
microscopy, and Hyperspectral imaging 
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Genomic and structural databases 

• CBS Genome Atlas Database L J — contains examples of base skews. J 

• The Z curve database of genomes — a 3-dimensional visualization and analysis tool of 
genomes [59][60] . 

• DNA and other nucleic acids' molecular models: Coordinate files of nucleic acids 
molecular structure models in PDB and CIF formats L J 
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External links 

• DNA the Double Helix Game (http://nobelprize.org/educational_games/medicine/ 
dnadoublehelix/) From the official Nobel Prize web site 

• MDDNA: Structural Bioinformatics of DNA (http://humphry.chem. wesleyan.edu:8080/ 
MDDNA/) 

• Double Helix 1953-2003 (http://www.ncbe.reading.ac.uk/DNA50/) National Centre 
for Biotechnology Education 

• DNA under electron microscope (http://www.fidelitysystems.com/Unlinked_DNA. 
html) 

• Ascalaph DNA (http://www.agilemolecule.com/Ascalaph/Ascalaph_DNA.html) — 
Commercial software for DNA modeling 

• DNAlive: a web interface to compute DNA physical properties (http://mmb.pcb.ub.es/ 
DNAlive). Also allows cross-linking of the results with the UCSC Genome browser and 
DNA dynamics. 

• DiProDB: Dinucleotide Property Database (http://diprodb.fli-leibniz.de). The database 
is designed to collect and analyse thermodynamic, structural and other dinucleotide 
properties. 

• Further details of mathematical and molecular analysis of DNA structure based on X-ray 
data (http ://planetphysics . org/encyclopedia/ 
BesselFunctionsApplicationsToDiffractionByHelicalStructures.html) 

• Bessel functions corresponding to Fourier transforms of atomic or molecular helices. 
(http://planetphysics.org/?op=getobj&from=objects& 
name=BesselFunctionsAndTheirApplicationsToDiffractionByHelicalStructures) 

• Application of X-ray microscopy in analysis of living hydrated cells (http://www.ncbi. 
nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract& 
list_uids=12379938) 

• Characterization in nanotechnology some pdfs (http://nanocharacterization.sitesled. 
com/) 

• overview of STM/AFM/SNOM principles with educative videos (http://www.ntmdt.ru/ 
SPM-Techniques/Principles/) 

• SPM Image Gallery - AFM STM SEM MFM NSOM and More (http://www.rhk-tech.com/ 
results/showcase. php) 

• How SPM Works (http://www.parkafm.com/New_html/resources/01general.php) 

• U.S. National DNA Day (http://www.genome.gov/10506367) — watch videos and 
participate in real-time discussions with scientists. 

• The Secret Life of DNA - DNA Music compositions (http://www.tjmitchell.com/stuart/ 
dna.html) 
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DNA microarray 




Example of an approximately 40,000 probe spotted oligo microarray 
with enlarged inset to show detail. 



For terminology, see 
glossary below. 

A DNA microarray is a 

multiplex technology used in 

molecular biology and in 

medicine. It consists of an 

arrayed series of thousands of 

microscopic spots of DNA 

oligonucleotides, called 

features, each containing 

picomoles of a specific DNA 

sequence. This can be a short 

section of a gene or other DNA 

element that are used as 

probes to hybridize a cDNA or 

cRNA sample (called target) under high-stringency conditions. Probe-target hybridization is 

usually detected and quantified by detection of fluorophore-, silver-, or 

chemiluminescence-labeled targets to determine relative abundance of nucleic acid 

sequences in the target. 

In standard microarrays, the probes are attached to a solid surface by a covalent bond to a 
chemical matrix (via epoxy-silane, amino-silane, lysine, polyacrylamide or others). The solid 
surface can be glass or a silicon chip, in which case they are commonly known as gene chip 
or colloquially Affy chip when an Affymetrix chip is used. Other microarray platforms, such 
as Illumina, use microscopic beads, instead of the large solid support. DNA arrays are 
different from other types of microarray only in that they either measure DNA or use DNA 
as part of its detection system. 

DNA microarrays can be used to measure changes in expression levels, to detect single 
nucleotide polymorphisms (SNPs) (see uses and types section), in genotyping or in 
resequencing mutant genomes. Microarrays also differ in fabrication, workings, accuracy, 
efficiency, and cost (see fabrication section). Additional factors for microarray experiments 
are the experimental design and the methods of analyzing the data (see Bioinformatics 
section). 



History 

Microarray technology evolved from Southern blotting, where fragmented DNA is attached 
to a substrate and then probed with a known gene or fragment. The use of a collection of 
distinct DNAs in arrays for expression profiling was first described in 1987, and the arrayed 
DNAs were used to identify genes whose expression is modulated by interferon. 1] These 
early gene arrays were made by spotting cDNAs onto filter paper with a pin-spotting 
device. The use of miniaturized microarrays for gene expression profiling was first reported 
in 1995, L J and a complete eukaryotic genome (Saccharomyces cerevisiae) on a microarray 
was published in 1997. [3] 
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Uses and types 

Arrays of DNA can be spatially arranged, as in the commonly 
known gene chip (also called genome chip, DNA chip or gene 
array), or can be specific DNA sequences labelled such that 
they can be independently identified in solution. The 
traditional solid-phase array is a collection of microscopic 
DNA spots attached to a solid surface, such as glass, plastic 
or silicon biochip. The affixed DNA segments are known as 
probes (although some sources use different terms such as 
reporters). Thousands of them can be placed in known 
locations on a single DNA microarray. 




Two Affymetrix chips 



DNA microarrays can be used to detect DNA (as in comparative genomic hybridization), or 
detect RNA (most commonly as cDNA after reverse transcription) that may or may not be 
translated into proteins. The process of measuring gene expression via cDNA is called 
expression analysis or expression profiling. 

Since an array can contain tens of thousands of probes, a microarray experiment can 
accomplish that many genetic tests in parallel. Therefore arrays have dramatically 
accelerated many types of investigation. 

Applications include: 



Technology or Application 


Synopsis 


Gene expression profiling 


In an mRNA or gene expression profiling experiment the expression levels of 
thousands of genes are simultaneously monitored to study the effects of certain 
treatments, diseases, and developmental stages on gene expression. For 
example, microarray-based gene expression profiling can be used to identify 
genes whose expression is changed in response to pathogens or other 
organisms by comparing gene expression in infected to that in uninfected cells 
or tissues. 


Comparative genomic 
hybridization 


Assessing genome content in different cells or closely related organisms. 


GenelD 


Small microarrays to check IDs of organisms in food and feed (like GMO [7]), 
mycoplasms in cell culture, or pathogens for disease detection, mostly 
combining PCR and microarray technology. 


Chromatin immimoprecipitation 
on Chip 


DNA sequences bound to a particular protein can be isolated by 
immunoprecipitating that protein (ChIP), these fragments can be then 
hybridized to a microarray (such as a tiling array) allowing the determination of 
protein binding site occupancy throughout the genome. Example protein to 
immunoprecipitate are histone modifications (H3K27me3, H3K4me2, 
H3K9me3, etc), Polycomb-group protein (PRC2:Suzl2, PRC1:YY1) and 
trithorax-group protein (Ashl) to study the epigenetic landscape or RNA 
Polymerase II to study the transcription landscape. 


SNP detection 


Identifying single nucleotide polymorphism among alleles within or between 
populations. Several applications of microarrays make use of SNP detection, 
including Genotyping, forensic analysis, measuring predisposition to disease, 
identifying drug-candidates, evaluating germline mutations in individuals or 
somatic mutations in cancers, assessing loss of heterozygosity, or genetic 
linkage analysis. 
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Alternative splicing detection 


An 'exon junction array design uses probes specific to the expected or potential 
splice sites of predicted exons for a gene. It is of intermediate density, or 
coverage, to a typical gene expression array (with 1-3 probes per gene) and a 
genomic tiling array (with hundreds or thousands of probes per gene). It is used 
to assay the expression of alternative splice forms of a gene. Exon arrays have a 
different design, employing probes designed to detect each individual exon for 
known or predicted genes, and can be used for detecting different splicing 
isoforms. 


Fusion genes microarray 


A Fusion gene microarray can detect fusion transcripts, e.g. from cancer 
specimens. The principle behind this is building on the alternative splicing 
micorrays. The oligo design strategy enables combined measurements of 
chimeric transcript junctions with exon-wise measurements of individual fusion 
partners. 


Tiling array 


Genome tiling arrays consist of overlapping probes designed to densely 
represent a genomic region of interest, sometimes as large as an entire human 
chromosome. The purpose is to empirically detect expression of transcripts or 
alternatively splice forms which may not have been previously known or 
predicted. 



Fabrication 

Microarrays can be manufactured in different ways, depending on the number of probes 
under examination, costs, customization requirements, and the type of scientific question 
being asked. Arrays may have as few as 10 probes or up to 2.1 million (NimbleGen, Roche) 
micrometre-scale probes from commercial vendors. 

Surface engineering 

The first step of DNA microarray fabrication involves surface engineering of a substrate in 
order to obtain desirable surface properties for the application of interest. Optimal surface 
properties are those which produce high signal to noise ratios for the DNA targets of 
interest. Generally, this involves maximizing the probe surface density and activity while 
minimizing the non-specific binding of the targets of interest. Methods of surface 
engineering vary depending on the platform material, design, and application. 



Spotted vs. oligonucleotide arrays 

Microarrays can be fabricated using a variety of technologies, including printing with 
fine-pointed pins onto glass slides, photolithography using pre-made masks, 
photolithography using dynamic micromirror devices, ink-jet printing, J or electrochemistry 
on microelectrode arrays. 

In spotted microarrays, the probes are oligonucleotides, cDNA or small fragments of PCR 
products that correspond to mRNAs. The probes are synthesized prior to deposition on the 
array surface and are then "spotted" onto glass. A common approach utilizes an array of 
fine pins or needles controlled by a robotic arm that is dipped into wells containing DNA 
probes and then depositing each probe at designated locations on the array surface. The 
resulting "grid" of probes represents the nucleic acid profiles of the prepared probes and is 
ready to receive complementary cDNA or cRNA "targets" derived from experimental or 
clinical samples. This technique is used by research scientists around the world to produce 
"in-house" printed microarrays from their own labs. These arrays may be easily customized 
for each experiment, because researchers can choose the probes and printing locations on 
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the arrays, synthesize the probes in their own lab (or collaborating facility), and spot the 
arrays. They can then generate their own labeled samples for hybridization, hybridize the 
samples to the array, and finally scan the arrays with their own equipment. This provides a 
relatively low-cost microarray that may be customized for each study, and avoids the costs 
of purchasing often more expensive commercial arrays that may represent vast numbers of 
genes that are not of interest to the investigator. Publications exist which indicate in-house 
spotted microarrays may not provide the same level of sensitivity compared to commercial 
oligonucleotide arrays/ ] possibly owing to the small batch sizes and reduced printing 
efficiencies when compared to industrial manufactures of oligo arrays. 

In oligonucleotide microarrays, the probes are short sequences designed to match parts of 
the sequence of known or predicted open reading frames. Although oligonucleotide probes 
are often used in "spotted" microarrays, the term "oligonucleotide array" most often refers 
to a specific technique of manufacturing. Oligonucleotide arrays are produced by printing 
short oligonucleotide sequences designed to represent a single gene or family of gene 
splice-variants by synthesizing this sequence directly onto the array surface instead of 
depositing intact sequences. Sequences may be longer (60-mer probes such as the Agilent 
design) or shorter (25-mer probes produced by Affymetrix) depending on the desired 
purpose; longer probes are more specific to individual target genes, shorter probes may be 
spotted in higher density across the array and are cheaper to manufacture. One technique 
used to produce oligonucleotide arrays include photolithographic synthesis (Agilent and 
Affymetrix) on a silica substrate where light and light-sensitive masking agents are used to 
"build" a sequence one nucleotide at a time across the entire array. 11] Each applicable 
probe is selectively "unmasked" prior to bathing the array in a solution of a single 
nucleotide, then a masking reaction takes place and the next set of probes are unmasked in 
preparation for a different nucleotide exposure. After many repetitions, the sequences of 
every probe become fully constructed. More recently, Maskless Array Synthesis from 
NimbleGen Systems has combined flexibility with large numbers of probes. J 
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Two-channel vs. one-channel detection 

Two-color microarrays or two-channel microarrays are 
typically hybridized with cDNA prepared from two 
samples to be compared (e.g. diseased tissue versus 
healthy tissue) and that are labeled with two different 
fluorophores. ] Fluorescent dyes commonly used for 
cDNA labelling include Cy3, which has a fluorescence 
emission wavelength of 570 nm (corresponding to the 
green part of the light spectrum), and Cy5 with a 
fluorescence emission wavelength of 670 nm 
(corresponding to the red part of the light spectrum). 
The two Cy-labelled cDNA samples are mixed and 
hybridized to a single microarray that is then scanned 
in a microarray scanner to visualize fluorescence of the 
two fluorophores after excitation with a laser beam of a 
defined wavelength. Relative intensities of each 
fluorophore may then be used in ratio-based analysis to 
identify up-regulated and down-regulated genes. ] 



Cancer Cells 



Normal Cells 




"Red F! cures cent" Probes '"Green Fluorescent" Probes 

Combine Targets 




Diagram of typical dual-colour 
microarray experiment. 



Oligonucleotide microarrays often contain control 
probes designed to hybridize with RNA spike-ins. The 

degree of hybridization between the spike-ins and the control probes is used to normalize 
the hybridization measurements for the target probes. Although absolute levels of gene 
expression may be determined in the two-color array, the relative differences in expression 
among different spots within a sample and between samples is the preferred method of 
data analysis for the two-color system. Examples of providers for such microarrays includes 
Agilent with their Dual-Mode platform, Eppendorf with their DualChip platform for 
colorimetric Silverquant labeling, and TeleChem International with Arrayit. 

In single-channel microarrays or one-color microarrays, the arrays are designed to give 
estimations of the absolute levels of gene expression. Therefore the comparison of two 
conditions requires two separate single-dye hybridizations. As only a single dye is used, the 
data collected represent absolute values of gene expression. These may be compared to 
other genes within a sample or to reference "normalizing" probes used to calibrate data 
across the entire array and across multiple arrays. Three popular single-channel systems 
are the Affymetrix "Gene Chip", the Applied Microarrays "CodeLink" arrays, and the 
Eppendorf "DualChip & Silverquant". One strength of the single-dye system lies in the fact 
that an aberrant sample cannot affect the raw data derived from other samples, because 
each array chip is exposed to only one sample (as opposed to a two-color system in which a 
single low-quality sample may drastically impinge on overall data precision even if the other 
sample was of high quality). Another benefit is that data are more easily compared to 
arrays from different experiments; the absolute values of gene expression may be compared 
between studies conducted months or years apart. A drawback to the one-color system is 
that, when compared to the two-color system, twice as many microarrays are needed to 
compare samples within an experiment. 
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Microarrays and bioinformatics 

The advent of inexpensive microarray experiments created 
several specific bioinformatics challenges: 

• the multiple levels of replication in experimental design 
(Experimental design) 

• the number of platforms and independent groups and 
data format (Standardization) 

• the treatment of the data (Statistical analysis) 

• accuracy and precision (Relation between probe and 
gene) 

• the sheer volume of data and the ability to share it (Data 
warehousing) 

Experimental design 
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Gene expression values from 

microarray experiments can be 

represented as heat maps to 

visualize the result of data 

analysis. 



Due to the biological complexity of gene expression, the 

considerations of experimental design that are discussed in 

the expression profiling article are of critical importance if statistically and biologically 

valid conclusions are to be drawn from the data. 

There are three main elements to consider when designing a microarray experiment. First, 
replication of the biological samples is essential for drawing conclusions from the 
experiment. Second, technical replicates (two RNA samples obtained from each 
experimental unit) help to ensure precision and allow for testing differences within 
treatment groups. The technical replicates may be two independent RNA extractions or two 
aliquots of the same extraction. Third, spots of each cDNA clone or oligonucleotide are 
present as replicates (at least duplicates) on the microarray slide, to provide a measure of 
technical precision in each hybridization. It is critical that information about the sample 
preparation and handling is discussed, in order to help identify the independent units in the 
experiment and to avoid inflated estimates of statistical significance. 15] 



Standardization 

Microarray data is difficult to exchange due to the lack of standardization in platform 
fabrication, assay protocols, and analysis methods. This presents an interoperability 
problem in bioinformatics. Various grass-roots open-source projects are trying to ease the 
exchange and analysis of data produced with non-proprietary chips: 

• For example, the "Minimum Information About a Microarray Experiment" (MIAME) 
checklist helps define the level of detail that should exist and is being adopted by many 
journals as a requirement for the submission of papers incorporating microarray results. 
But MIAME does not describe the format for the information, so while many formats can 
support the MIAME requirements, as of 2007 no format permits verification of complete 
semantic compliance. 

• The "MicroArray Quality Control (MAQC) Project" is being conducted by the US Food and 
Drug Administration (FDA) to develop standards and quality control metrics which will 
eventually allow the use of MicroArray data in drug discovery, clinical practice and 
regulatory decision-making. J 
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• The MGED Society has developed standards for the representation of gene expression 
experiment results and relevant annotations. 

Statistical analysis 

The analysis of DNA microarrays poses a large number of statistical problems, including the 
normalization of the data. There are several normalization methods in the published 
literature some of which are platform specific; as in many other cases where authorities 
disagree, a sound conservative approach is to directly compare different normalization 
methods to determine the effects of these different methods on the results obtained. This 
can be done, for example, by investigating the performance of various methods on data 
from "spike-in" experiments. 

Also, experimenters must account for multiple comparisons: even if the statistical P-value 
assigned to a gene indicates that it is extremely unlikely that differential expression of this 
gene was due to random rather than treatment effects, the very high number of genes on 
an array makes it likely that differential expression of some genes represent false positives 
or false negatives. Statistical methods tailored to microarray analyses have recently 
become available that assess statistical power based on the variation present in the data 
and the number of experimental replicates, and can help minimize type I and type II errors 
in the analyses. J 

A basic difference between microarray data analysis and much traditional biomedical 
research is the dimensionality of the data. A large clinical study might collect 100 data 
items per patient for thousands of patients. A medium-size microarray study will obtain 
many thousands of numbers per sample for perhaps a hundred samples. Many analysis 
techniques treat each sample as a single point in a space with thousands of dimensions, 
then attempt by various techniques to reduce the dimensionality of the data to something 
humans can visualize. J An example for such a method is the Local Pooled Error (LPE) 
test, which pools standard deviations of genes with similar expression levels and thereby 
overcomes the problem of low replicate numbers. J 

Relation between probe and gene 

The relation between a probe and the mRNA that it is expected to detect is problematic. On 
the one hand, some mRNAs may cross-hybridize probes in the array that are supposed to 
detect another mRNA. On the other hand, probes that are designed to detect the mRNA of a 
particular gene may be relying on genomic EST information that is incorrectly associated 
with that gene. 
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Data warehousing 

Microarray data was found to be more useful when compared to other similar datasets. The 
sheer volume (in bytes), specialized formats (such as MIAME), and curation efforts 
associated with the datasets require specialized databases to store the data. 

See also 

• Systems biology 

• Microfluidics or lab-on-chip 

• Cyanine dyes, such as Cy3 and Cy5, are commonly used fluorophores with microarrays 

• Serial analysis of gene expression 

• Significance analysis of microarrays 

• Full Genome Sequencing 
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Glossary 

• An Array or slide is a collection of features spatially arranged in a two dimensional grid, 
arranged in columns and rows. 

• Block or subarray: a group of spots, typically made in one print round; several 
subarrays/blocks form an array. 

• Case/control: an experimental design paradigm especially suited to the two-colour array 
system, in which a condition chosen as control (such as healthy tissue or state) is 
compared to an altered condition (such as a diseased tissue or state). 

• Channel: the fluorescence output recorded in the scanner for an individual fluorophore 
and can even be ultraviolet. 

• Dye flip or Dye swap or Fluor reversal: reciprocal labelling of DNA targets with the 
two dyes to account for dye bias in experiments. 

• Scanner: an instrument used to detect and quantify the intensity of fluorescence of spots 
on a microarray slide, by selectively exciting fluorophores with a laser and measuring the 
fluorescence with a filter (optics) photomultiplier system. 

• Spot or feature: a small area on an array slide that contains picomoles of specific DNA 
samples. 

• For other relevant terms see: 

Glossary of gene expression terms 
Protocol (natural sciences) 
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External links 

• Many important links can be found at the Open Directory Project 

• Gene Expression (http://www.dmoz.org/Science/Biology/ 
BiochemistryandMolecularBiology/GeneExpression/) at the Open Directory 
Project 

• Micro Scale Products and Services for Biochemistry and Molecular Biology (http:// 
www.dmoz.org/Science/Biology/Biochemistry_and_Molecular_Biology/ 
ProductsandServices/MicroScale/) at the Open Directory Project 

• Products and Services for Gene Expression (http://www.dmoz.org/Science/Biology/ 
BiochemistryandMolecularBiology/GeneExpression/ProductsandServices/) at 
the Open Directory Project 

• PLoS Biology Primer: Microarray Analysis (http://biology.plosjournals.org/perlserv/ 
?request=get-document&doi= 10.1 37 1/journal.pbio. 00000 15) 

• Rundown of microarray technology (http://www.genome.gov/page. 
cfm?pageID=10000533) 

• ArrayMining.net (http://www.arraymining.net) - a free web-server for online 
microarray analysis 

• CLASSIFI (http://pathcuricl.swmed.edu/pathdb/classifi.html) - Gene Ontology-based 
gene cluster classification resource 

• Microarray - How does it work? (http://www.unsolvedmysteries.oregonstate.edu/ 
microarray_07) 

• Microarray data processing using Self-Organizing Maps tutorial: Part 1 (http://blog. 
peltarion.com/2007/04/10/the-self-organized-gene-part-l) Part 2 (http://blog. 
peltarion . com/2 7/06/ 1 3/the-self-organized-gene-part- 2 ) 
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DNA sequencing 

The term DNA sequencing refers to methods for determining the order of the nucleotide 
bases, adenine, guanine, cytosine, and thymine, in a molecule of DNA. The first DNA 
sequences were obtained by academic researchers, using laborious methods based on 
2-dimensional chromatography in the early 1970s. Following the development of dye-based 
sequencing methods with automated analysis, DNA sequencing has become easier and 
orders of magnitude faster. Knowledge of DNA sequences of genes and other parts of the 
genome of organisms has become indispensable for basic research studying biological 
processes, as well as in applied fields such as diagnostic or forensic research. The advent of 
DNA sequencing has significantly accelerated biological research and discovery. The rapid 
speed of sequencing attained with modern DNA sequencing technology has been 
instrumental in the sequencing of the human genome, in the Human Genome Project. 
Related projects, often by scientific collaboration across continents, have generated the 
complete DNA sequences of many animal, plant, and microbial genomes. 

RNA sequencing, which is 
technically easier to perform 
than DNA sequencing, was 
one of the earliest forms of 
nucleotide sequencing. The 

major landmark of RNA DNA Sequence Trace 



sequencing is the sequence of 

the first complete gene and the complete genome of Bacteriophage MS2, identified and 
published by Walter Fiers and his coworkers at the University of Ghent (Ghent, Belgium), 
between 1972 [1] and 1976. [2] 

Prior to the development of rapid DNA sequencing methods in the early 1970s by Frederick 
Sanger at the University of Cambridge, in England and Walter Gilbert and Allan Maxam at 
Harvard, ] [ ] a number of laborious methods were used. For instance, in 1973, Gilbert and 
Maxam reported the sequence of 24 basepairs using a method known as wandering-spot 
analysis. [5] 

The chain-termination method developed by Sanger and coworkers in 1975 soon became 
the method of choice, owing to its relative ease and reliability. J L J 

Maxam-Gilbert sequencing 

In 1976-1977, Allan Maxam and Walter Gilbert developed a DNA sequencing method based 
on chemical modification of DNA and subsequent cleavage at specific bases. J Although 
Maxam and Gilbert published their chemical sequencing method two years after the 
ground-breaking paper of Sanger and Coulson on plus-minus sequencing, ] [ ] 
Maxam-Gilbert sequencing rapidly became more popular, since purified DNA could be used 
directly, while the initial Sanger method required that each read start be cloned for 
production of single-stranded DNA. However, with the improvement of the 
chain-termination method (see below), Maxam-Gilbert sequencing has fallen out of favour 
due to its technical complexity prohibiting its use in standard molecular biology kits, 
extensive use of hazardous chemicals, and difficulties with scale-up. 
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The method requires radioactive labelling at one end and purification of the DNA fragment 
to be sequenced. Chemical treatment generates breaks at a small proportion of one or two 
of the four nucleotide bases in each of four reactions (G, A+G, C, C+T). Thus a series of 
labelled fragments is generated, from the radiolabeled end to the first 'cut' site in each 
molecule. The fragments in the four reactions are arranged side by side in gel 
electrophoresis for size separation. To visualize the fragments, the gel is exposed to X-ray 
film for autoradiography, yielding a series of dark bands each corresponding to a 
radiolabeled DNA fragment, from which the sequence may be inferred. 

Also sometimes known as 'chemical sequencing 1 , this method originated in the study of 
DNA-protein interactions (footprinting), nucleic acid structure and epigenetic modifications 
to DNA, and within these it still has important applications. 



Chain-termination methods 

Because the chain-terminator method (or Sanger method 
after its developer Frederick Sanger) is more efficient 
and uses fewer toxic chemicals and lower amounts of 
radioactivity than the method of Maxam and Gilbert, it 
rapidly became the method of choice. The key principle 
of the Sanger method was the use of dideoxynucleotide 
triphosphates (ddNTPs) as DNA chain terminators. 

The classical chain-termination method requires a 
single-stranded DNA template, a DNA primer, a DNA 
polymerase, radioactively or fluorescently labeled 
nucleotides, and modified nucleotides that terminate 
DNA strand elongation. The DNA sample is divided into 
four separate sequencing reactions, containing all four of 
the standard deoxynucleotides (dATP, dGTP, dCTP and 
dTTP) and the DNA polymerase. To each reaction is 
added only one of the four dideoxynucleotides (ddATP, 
ddGTP, ddCTP, or ddTTP) which are the 
chain-terminating nucleotides, lacking a 3'-OH group 
required for the formation of a phosphodiester bond 
between two nucleotides, thus terminating DNA strand 
extension and resulting in various DNA fragments of 
varying length. 




Part of a radioactively labelled 
sequencing gel 



The newly synthesized and labeled DNA fragments are 
heat denatured, and separated by size (with a resolution 

of just one nucleotide) by gel electrophoresis on a denaturing polyacrylamide-urea gel with 
each of the four reactions run in one of four individual lanes (lanes A, T, G, C); the DNA 
bands are then visualized by autoradiography or UV light, and the DNA sequence can be 
directly read off the X-ray film or gel image. In the image on the right, X-ray film was 
exposed to the gel, and the dark bands correspond to DNA fragments of different lengths. A 
dark band in a lane indicates a DNA fragment that is the result of chain termination after 

incorporation of a dideoxynucleotide (ddATP, ddGTP, ddCTP, or ddTTP). The relative 
positions of the different bands among the four lanes are then used to read (from bottom to 
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top) the DNA sequence. 
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DNA fragments are labeled with a 

radioactive or fluorescent tag on the 

primer (1), in the new DNA strand with 

a labeled dNTP, or with a labeled 

ddNTP. (click to expand) 



Technical variations of chain-termination sequencing 
include tagging with nucleotides containing radioactive 
phosphorus for radiolabelling, or using a primer labeled 
at the 5' end with a fluorescent dye. Dye-primer 
sequencing facilitates reading in an optical system for 
faster and more economical analysis and automation. 



The later development by Leroy Hood and coworkers 



[9] 



L J of fluorescently labeled ddNTPs and primers set the 
stage for automated, high-throughput DNA sequencing. 



Chain-termination methods have greatly simplified DNA 
sequencing. For example, chain-termination-based kits 
are commercially available that contain the reagents 
needed for sequencing, pre-aliquoted and ready to use. 
Limitations include non-specific binding of the primer 
to the DNA, affecting accurate read-out of the DNA 
sequence, and DNA secondary structures affecting the 
fidelity of the sequence. 






G C A T ~ 

Sequence ladder by radioactive 

sequencing compared to fluorescent 

peaks (click to expand) 



DNA sequencing 



61 



Dye-terminator sequencing 



Migration of DNA 
fragments 
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Capillary electrophoresis (click to 
expand) 



Dye-terminator sequencing utilizes labelling of the 
chain terminator ddNTPs, which permits sequencing in 
a single reaction, rather than four reactions as in the 
labelled-primer method. In dye-terminator sequencing, 
each of the four dideoxynucleotide chain terminators is 
labelled with fluorescent dyes, each of which with 
different wavelengths of fluorescence and emission. 
Owing to its greater expediency and speed, 
dye-terminator sequencing is now the mainstay in 
automated sequencing. Its limitations include dye 
effects due to differences in the incorporation of the dye-labelled chain terminators into the 
DNA fragment, resulting in unequal peak heights and shapes in the electronic DNA 
sequence trace chromatogram after capillary electrophoresis (see figure to the right). This 
problem has been addressed with the use of modified DNA polymerase enzyme systems and 
dyes that minimize incorporation variability, as well as methods for eliminating "dye blobs". 
The dye-terminator sequencing method, along with automated high-throughput DNA 
sequence analyzers, is now being used for the vast majority of sequencing projects. 

Challenges 

Common challenges of DNA sequencing include poor quality in the first 15-40 bases of the 
sequence and deteriorating quality of sequencing traces after 700-900 bases. Base calling 
software typically gives an estimate of quality to aid in quality trimming. 

In cases where DNA fragments are cloned before sequencing, the resulting sequence may 
contain parts of the cloning vector. In contrast, PCR-based cloning and emerging 
sequencing technologies based on pyrosequencing often avoid using cloning vectors. 




View of the start of an example 
dye-terminator read (click to expand) 



Automation and sample preparation 

Automated DNA-sequencing instruments (DNA 

sequencers) can sequence up to 384 DNA samples in a 

single batch (run) in up to 24 runs a day. DNA 

sequencers carry out capillary electrophoresis for size 

separation, detection and recording of dye 

fluorescence, and data output as fluorescent peak trace 

chromatograms. Sequencing reactions by thermocycling, cleanup and re-suspension in a 

buffer solution before loading onto the sequencer are performed separately. A number of 

commercial and non-commercial software packages can trim low-quality DNA traces 

automatically. These programs score the quality of each peak and remove low-quality base 

peaks (generally located at the ends of the sequence). The accuracy of such algorithms is 

below visual examination by a human operator, but sufficient for automated processing of 

large sequence data sets. 
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DNA extraction 



Large-scale sequencing strategies 

Current methods can directly sequence only relatively short (300-1000 nucleotides long) 
DNA fragments in a single reaction. 1 J The main obstacle to sequencing DNA fragments 
above this size limit is insufficient power of separation for resolving large DNA fragments 
that differ in length by only one nucleotide. 

Large-scale sequencing aims at sequencing very long 
DNA pieces, such as whole chromosomes. Common 
approaches consist of cutting (with restriction enzymes) 
or shearing (with mechanical forces) large DNA 
fragments into shorter DNA fragments. The fragmented 
DNA is cloned into a DNA vector, and amplified in 
Escherichia coli. Short DNA fragments purified from 
individual bacterial colonies are individually sequenced 
and assembled electronically into one long, contiguous 
sequence. This method does not require any 
pre-existing information about the sequence of the DNA 
and is referred to as de novo sequencing. Gaps in the 
assembled sequence may be filled by primer walking. 
The different strategies have different tradeoffs in 
speed and accuracy; shotgun methods are often used 
for sequencing large genomes, but its assembly is 
complex and difficult, particularly with sequence 
repeats often causing gaps in genome assembly. 
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Genomic DNA is fragmented into 

random pieces and cloned as a 

bacterial library. DNA from individual 

bacterial clones is sequenced and the 

sequence is assembled by using 

overlapping DNA regions. (click to 

expand) 



New sequencing methods 



High-throughput sequencing 

The high demand for low-cost sequencing has driven the development of high-throughput 
sequencing technologies that parallelize the sequencing process, producing thousands or 
millions of sequences at once. 12] [13] High-throughput sequencing technologies are 
intended to lower the cost of DNA sequencing beyond what is possible with standard 
dye-terminator methods. 

In vitro clonal amplification 

Molecular detection methods are not sensitive enough for single molecule sequencing, so 
most approaches use an in vitro cloning step to amplify individual DNA molecules. Emulsion 
PCR isolates individual DNA molecules along with primer-coated beads in aqueous droplets 
within an oil phase. Polymerase chain reaction (PCR) then coats each bead with clonal 
copies of the DNA molecule followed by immobilization for later sequencing. Emulsion PCR 
is used in the methods by Marguilis et al. (commercialized by 454 Life Sciences), Shendure 
and Porreca et al. (also known as "polony sequencing") and SOLiD sequencing, (developed 
by Agencourt, now Applied Biosystems). J L J L J Another method for in vitro clonal 
amplification is bridge PCR, where fragments are amplified upon primers attached to a 
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solid surface. The single-molecule method developed by Stephen Quake's laboratory (later 
commercialized by Helicos) skips this amplification step, directly fixing DNA molecules to a 
surface. 17 - 1 

Parallelized sequencing 

DNA molecules are physically bound to a surface, and sequenced in parallel. Sequencing by 
synthesis, like dye-termination electrophoretic sequencing, uses a DNA polymerase to 
determine the base sequence. Reversible terminator methods (used by Illumina and 
Helicos) use reversible versions of dye-terminators, adding one nucleotide at a time, detect 
fluorescence at each position in real time, by repeated removal of the blocking group to 
allow polymerization of another nucleotide. Pyrosequencing (used by 454) also uses DNA 
polymerization, adding one nucleotide species at a time and detecting and quantifying the 
number of nucleotides added to a given location through the light emitted by the release of 
attached pyrophosphates. 14] [18] 

Sequencing by ligation 

This enzymatic sequencing method uses a DNA ligase to determine the target sequence. 15] 
•- - 1 •- J Used in the polony method and in the SOLiD technology, it uses a pool of all 
possible oligonucleotides of a fixed length, labeled according to the sequenced position. 
Oligonucleotides are annealed and ligated; the preferential ligation by DNA ligase for 
matching sequences results in a signal informative of the nucleotide at that position. 

Microfluidic Sanger Sequencing 

In microfluidic Sanger sequencing the entire thermocycling amplification of DNA fragments 
as well as their separation by electrophoresis is done on a single chip (approximately 
100 cm in diameter) thus reducing the reagent usage as well as cost. In some instances 
researchers have shown that they can increase the through-put of conventional sequencing 
through the use of microchips. Research will still need to be done in order to make this use 
of technology effective. 

Other sequencing technologies 

Sequencing by hybridization is a non-enzymatic method that uses a DNA microarray. A 
single pool of DNA whose sequence is to be determined is fluorescently labeled and 
hybridized to an array containing known sequences. Strong hybridization signals from a 
given spot on the array identifies its sequence in the DNA being sequenced. J Mass 
spectrometry may be used to determine mass differences between DNA fragments 
produced in chain-termination reactions. 1 J 

DNA sequencing methods currently under development include labeling the DNA 
polymerase, ] reading the sequence as a DNA strand transits through nanopores, ] [ ] 
and microscopy-based techniques, such as AFM or electron microscopy that are used to 
identify the positions of individual nucleotides within long DNA fragments (>5,000 bp) by 
nucleotide labeling with heavier elements (e.g., halogens) for visual detection and 
recording. 5] 

In October 2006, the X Prize Foundation established an initiative to promote the 
development of full genome sequencing technologies, called the Archon X Prize, intending 
to award $10 million to "the first Team that can build a device and use it to sequence 100 
human genomes within 10 days or less, with an accuracy of no more than one error in every 
100,000 bases sequenced, with sequences accurately covering at least 98% of the genome, 
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and at a recurring cost of no more than $10,000 (US) per genome. " [26] 

Major landmarks in DNA sequencing 

• 1953 Discovery of the structure of the DNA double helix. 

• 1972 Development of recombinant DNA technology, which permits isolation of defined 
fragments of DNA; prior to this, the only accessible samples for sequencing were from 
bacteriophage or virus DNA. 

• 1975 The first complete DNA genome to be sequenced is that of bacteriophage cpX174 

• 1977 Allan Maxam and Walter Gilbert publish "DNA sequencing by chemical 
degradation". ] Frederick Sanger, independently, publishes "DNA sequencing by 
enzymatic synthesis". 

• 1980 Frederick Sanger and Walter Gilbert receive the Nobel Prize in Chemistry 

• 1984 Medical Research Council scientists decipher the complete DNA sequence of the 
Epstein-Barr virus, 170 kb. 

• 1986 Leroy E. Hood's laboratory at the California Institute of Technology and Smith 
announce the first semi-automated DNA sequencing machine. 

• 1987 Applied Biosystems markets first automated sequencing machine, the model ABI 
370. 

• 1990 The U.S. National Institutes of Health (NIH) begins large-scale sequencing trials on 
Mycoplasma capricolum, Escherichia coli, Caenorhabditis elegans, and Saccharomyces 
cerevisiae (at 75 cents (US)/base). 

• 1995 Richard Mathies et al. publish dye-based sequencing. ] 

• 1998 Phil Green and Brent Ewing of the University of Washington publish "phred" for 
sequencer data analysis [ ] . 

See also 
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Full Genome Sequencing 

Genome project 

Single Molecule Real Time Sequencing 

Applied Biosystems 
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Illumina (company) 

Pacific Biosciences 

Complete Genomics 

Joint Genome Institute 

DNA field-effect transistor 

DNA sequencing theory 
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Paracrystal model and theory 

Paracrystalline materials are defined as having short and medium range ordering in their 
lattice (similar to the liquid crystal phases) but lacking long-range ordering at least in one 
direction. J 

Ordering is the regularity in which atoms appear in a predictable lattice, as measured from 
one point. In a highly ordered, perfectly crystalline material, or single crystal, the location 
of every atom in the structure can be described exactly measuring out from a single origin. 
Conversely, in a disordered structure such as a liquid or amorphous solid, the location of 
the first and perhaps second nearest neighbors can be described from an origin (with some 
degree of uncertainty) and the ability to predict locations decreases rapidly from there out. 
The distance at which atom locations can be predicted is referred to as the correlation 
length £ . A paracrystalline material exhibits correlation somewhere between the fully 
amorphous and fully crystalline. 

The primary, most accessible source of crystallinity information is X-ray diffraction, 
although other techniques may be needed to observe the complex structure of 
paracrystalline materials, such as fluctuation electron microscopy L J in combination with 
Density of states modeling [3] of electronic and vibrational states. 

Paracrystalline Model 

The paracrystalline model is a revision of the Continuous Random Network model first 
proposed by W. H. Zachariasen in 1932 L J . The paracrystal model is defined as highly 
strained, microcrystalline grains surrounded by fully amorphous material [5] . This is a 
higher energy state then the continuous random network model. The important distinction 
between this model and the microcrystalline phases is the lack of defined grain boundaries 
and highly strained lattice parameters, which makes calculations of molecular and lattice 
dynamics difficult. A general theory of paracrystals has been formulated in a basic 
textbook^ J , and then further developed/refined by various authors. 

Applications 

The paracrystal model has been useful, for example, in describing the state of partially 
amorphous semiconductor materials after deposition. It has also been successfully applied 
to: synthetic polymers, liquid crystals, biopoloymers , and biomembranes L J . 

See also 

• X-ray scattering 

• Amorphous solid 

• Single Crystal 

• Polycrystalline 

• Crystallography 

• DNA 

• X-ray pattern of a B-DNA Paracrystal [10] 
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1. REDIRECT X-ray scattering techniques 

Crystallography 



Crystallography is the experimental science of determining the arrangement of atoms in 
solids. In older usage, it is the scientific study of crystals. The word "crystallography" is 
derived from the Greek words crystallon = cold drop / frozen drop, with its meaning 
extending to all solids with some degree of transparency, and graphein = write. 

Before the development of X-ray diffraction crystallography (see below), the study of 
crystals was based on the geometry of the crystals. This involves measuring the angles of 
crystal faces relative to theoretical reference axes (crystallographic axes), and establishing 
the symmetry of the crystal in question. The former is carried out using a goniometer. The 
position in 3D space of each crystal face is plotted on a stereographic net, e.g. Wulff net or 
Lambert net. In fact, the pole to each face is plotted on the net. Each point is labelled with 
its Miller index. The final plot allows the symmetry of the crystal to be established. 

Crystallographic methods now depend on the analysis of the diffraction patterns that 
emerge from a sample that is targeted by a beam of some type. The beam is not always 
electromagnetic radiation, even though X-rays are the most common choice. For some 
purposes electrons or neutrons are used, which is possible due to the wave properties of 
the particles. Crystallographers often explicitly state the type of illumination used when 
referring to a method, as with the terms X-ray diffraction, neutron diffraction and 
electron diffraction. 

These three types of radiation interact with the specimen in different ways. X-rays interact 
with the spatial distribution of the valence electrons, while electrons are charged particles 
and therefore feel the total charge distribution of both the atomic nuclei and the 
surrounding electrons. Neutrons are scattered by the atomic nuclei through the strong 
nuclear forces, but in addition, the magnetic moment of neutrons is non-zero. They are 
therefore also scattered by magnetic fields. When neutrons are scattered from 
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hydrogen-containing materials, they produce diffraction patterns with high noise levels. 
However, the material can sometimes be treated to substitute hydrogen for deuterium. 
Because of these different forms of interaction, the three types of radiation are suitable for 
different crystallographic studies. 

Theory 

An image of a small object is usually generated by using a lens to focus the illuminating 
radiation, as is done with the rays of the visible spectrum in light microscopy. However, the 
wavelength of visible light (about 4000 to 7000 Angstroms) is three orders of magnitude 
longer then the length of typical atomic bonds and atoms themselves (about 1 to 2 
Angstroms). Therefore, obtaining information about the spatial arrangement of atoms 
requires the use of radiation with shorter wavelengths, such as X-rays. Employing shorter 
wavelengths implied abandoning microscopy and true imaging, however, because there 
exists no material from which a lens capable of focusing this type of radiation can be 
created. (That said, scientists have had some success focusing X-rays with microscopic 
Fresnel zone plates made from gold, and by critical-angle reflection inside long tapered 
capillaries[l]). Diffracted x-ray beams cannot be focused to produce images, so the sample 
structure must be reconstructed from the diffraction pattern. Sharp features in the 
diffraction pattern arise from periodic, repeating structure in the sample, which are often 
very strong due to coherent reflection of many photons from many regularly spaced 
instances of similar structure, while non-periodic components of the structure result in 
diffuse (and usually weak) diffraction features. 

Because of their highly ordered and repetitive structure, crystals give diffraction patterns 
of sharp Bragg reflection spots, and are ideal for analyzing the structure of solids. 

Notation 

See Miller index for a full treatment of this topic. 

• Coordinates in square brackets such as [100] denote a direction vector (in real space). 

• Coordinates in angle brackets or chevrons such as <100> denote a family of directions 
which are related by symmetry operations. In the cubic crystal system for example, 
<100> would mean [100], [010], [001] or the negative of any of those directions. 

• Miller indices in parentheses such as (100) denote a plane of the crystal structure, and 
regular repetitions of that plane with a particular spacing. In the cubic system, the 
normal to the (hkl) plane is the direction [hkl], but in lower-symmetry cases, the normal 
to (hkl) is not parallel to [hkl]. 

• Indices in curly brackets or braces such as {100} denote a family of planes and their 
normals which are equivalent in cubic materials due to symmetry operations, much the 
way angle brackets denote a family of directions. In non-cubic materials, <hkl> is not 
necessarily perpendicular to {hkl}. 
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Technique 

Some materials studied using crystallography, proteins for example, do not occur naturally 
as crystals. Typically, such molecules are placed in solution and allowed to crystallize over 
days, weeks, or months through vapor diffusion. A drop of solution containing the molecule, 
buffer, and precipitants is sealed in a container with a reservoir containing a hygroscopic 
solution. Water in the drop diffuses to the reservoir, slowly increasing the concentration 
and allowing a crystal to form. If the concentration were to rise more quickly, the molecule 
would simply precipitate out of solution, resulting in disorderly granules rather than an 
orderly and hence usable crystal. 

Once a crystal is obtained, data can be collected using a beam of radiation. Although many 
universities that engage in crystallographic research have their own X-ray producing 
equipment, synchrotrons are often used as X-ray sources, because of the purer and more 
complete patterns such sources can generate. Synchrotron sources also have a much 
higher intensity of X-ray beams, so data collection takes a fraction of the time normally 
necessary at weaker sources. 

Producing an image from a diffraction pattern requires sophisticated mathematics and 
often an iterative process of modelling and refinement. In this process, the 
mathematically predicted diffraction patterns of an hypothesized or "model" structure are 
compared to the actual pattern generated by the crystalline sample. Ideally, researchers 
make several initial guesses, which through refinement all converge on the same answer. 
Models are refined until their predicted patterns match to as great a degree as can be 
achieved without radical revision of the model. This is a painstaking process, made much 
easier today by computers. 

The mathematical methods for the analysis of diffraction data only apply to patterns, which 
in turn result only when waves diffract from orderly arrays. Hence crystallography applies 
for the most part only to crystals, or to molecules which can be coaxed to crystallize for the 
sake of measurement. In spite of this, a certain amount of molecular information can be 
deduced from the patterns that are generated by fibers and powders, which while not as 
perfect as a solid crystal, may exhibit a degree of order. This level of order can be sufficient 
to deduce the structure of simple molecules, or to determine the coarse features of more 
complicated molecules (the double-helical structure of DNA, for example, was deduced 
from an X-ray diffraction pattern that had been generated by a fibrous sample). 

Crystallography in materials engineering 

Crystallography is a tool that is often employed by materials scientists. In single crystals, 
the effects of the crystalline arrangement of atoms is often easy to see macroscopically, 
because the natural shapes of crystals reflect the atomic structure. In addition, physical 
properties are often controlled by crystalline defects. The understanding of crystal 
structures is an important prerequisite for understanding crystallographic defects. Mostly, 
materials do not occur in a single crystalline, but poly-crystalline form, such that the 
powder diffraction method plays a most important role in structural determination. 

A number of other physical properties are linked to crystallography. For example, the 
minerals in clay form small, flat, platelike structures. Clay can be easily deformed because 
the platelike particles can slip along each other in the plane of the plates, yet remain 
strongly connected in the direction perpendicular to the plates. Such mechanisms can be 
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studied by crystallographic texture measurements. 

In another example, iron transforms from a body-centered cubic (bcc) structure to a 
face-centered cubic (fee) structure called austenite when it is heated. The fee structure is a 
close-packed structure, and the bcc structure is not, which explains why the volume of the 
iron decreases when this transformation occurs. 

Crystallography is useful in phase identification. When performing any process on a 
material, it may be desired to find out what compounds and what phases are present in the 
material. Each phase has a characteristic arrangement of atoms. Techniques like X-ray 
diffraction can be used to identify which patterns are present in the material, and thus 
which compounds are present (note: the determination of the "phases" within a material 
should not be confused with the more general problem of "phase determination," which 
refers to the phase of waves as they diffract from planes within a crystal, and which is a 
necessary step in the interpretation of complicated diffraction patterns). 

Crystallography covers the enumeration of the symmetry patterns which can be formed by 
atoms in a crystal and for this reason has a relation to group theory and geometry. See 
symmetry group. 

Biology 

X-ray crystallography is the primary method for determining the molecular conformations 
of biological macromolecules, particularly protein and nucleic acids such as DNA and RNA. 
In fact, the double-helical structure of DNA was deduced from crystallographic data. The 
first crystal structure of a macromolecule was solved in 1958 (Kendrew, J.C. et al. (1958) A 
three-dimensional model of the myoglobin molecule obtained by X-ray analysis (Nature 181, 
662-666). The Protein Data Bank (PDB) is a freely accessible repository for the structures 
of proteins and other biological macromolecules. Computer programs like RasMol or Pymol 
can be used to visualize biological molecular structures. 

Electron crystallography has been used to determine some protein structures, most notably 
membrane proteins and viral capsids. 
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Further reading 

• Burns, G.; Glazer, A.M. (1990). Space Groups for Scientists and Engineers (2nd ed.). 
Boston: Academic Press, Inc. ISBN 0-12-145761-3. 
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Oxford University Press. ISBN 0-19-855-901-1. 

• Drenth, J (1999). Principles of Protein X-Ray Crystallography. New York: Springer-Verlag. 
ISBN 0-387-98587-5. 
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• Glusker, JP; Lewis M, Rossi M (1994). Crystal Structure Analysis for Chemists and 
Biologists. New York: VCH Publishers. ISBN 0-471-18543-4. 

• O'Keeffe, M. ; Hyde, B.G. (1996). Crystal Structures; I. Patterns and Symmetry. 
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Applied Computational Powder Diffraction Data Analysis 

• Young, R.A., ed (1993). The Rietveld Method. Oxford: Oxford University Press & 
International Union of Crystallography. ISBN 0-19-855577-6. 

External links 

Introduction to Crystallography and Mineral Crystal Systems L J 

Crystallographic Teaching Pamphlets [3] 

Crystal Lattice Structures [ ] 

Freely Available Crystallographic Software for Academia [5] 

NetSci Software Listing for Crystallography [ ] 

SINCRIS Information Server for Crystallography [7] 

ORTEP a professional grade viewer for use on a PC which is based on the FORTRAN code 

which came from Oak Ridge [8] 

Vega Science Trust Interviews on Crystallography L J Freeview video interviews with Max 

Pertuz, Rober Huber and Aaron Klug. 

Commission on Crystallographic Teaching, Pamphlets L J 

Crystallography site of Steffen Weber with lots of Java Applets [11] 

IUCr Online Dictionary of Crystallography L J 

American Crystallographic Association [13] 

Laue Measurement of Single-Crystal Turbine Blades L J 

Ames Laboratory, US DOE Crystallography Research Resources [15] 



Crystallography 



74 



References 

[ 1 ] http ://scripts . iucr . org/cgi-bin/paper?kv5 3 7 

[2 ] http ://www. rockhounds . com/rockshop/xtal/index. html 

[ 3 ] http ://www. mineralogie . uni-wuerzburg . de/links/teach/cry steach. html 

[4] http ://cst-www. nrl. navy . mil/lattice/spcgrp/ 

[5] http://www.ccpl4.ac.uk/ 

[6] http://www.netsci.org/Resources/Software/Struct/xray.html 

[ 7 ] http ://www. iucr. org/sincris-top/ 

[8] http://www.chem.gla.ac.uk/~louis/software/ortep3/download.html 

[9] http://www.vega.org.uk 

[10] http ://www. iucr. org/iucr-top/comm/cteach/pamphlets . html 

[11] http://www.jcrystal.com/steffenweber/ 

[12] http ://reference. iucr. org/dictionary/Main_Page 

[13] http://aca.hwi.buffalo.edu/ 

[14] http://www.geocities.eom/psistar@sbcglobal.net/Laue.pdf 

[15] http ://www. mcbmm. ameslab . gov/index. html 



X-ray crystallography 



X-ray crystallography is a 

method of determining the 
arrangement of atoms within a 
crystal, in which a beam of 
X-rays strikes a crystal and 
scatters into many different 
directions. From the angles 
and intensities of these 
scattered beams, a 

crystallographer can produce 
a three-dimensional picture of 
the density of electrons within 
the crystal. From this electron 
density, the mean positions of 
the atoms in the crystal can be 
determined, as well as their 
chemical bonds, their disorder 
and various other information. 

Since many materials can form 

crystals — such as salts, 

metals, minerals, 

semiconductors, as well as 

various inorganic, organic and 

biological molecules — X-ray 

crystallography has been 

fundamental in the development of many scientific fields. In its first decades of use, this 

method determined the size of atoms, the lengths and types of chemical bonds, and the 

atomic-scale differences among various materials, especially minerals and alloys. The 
method also revealed the structure and functioning of many biological molecules, including 




X-ray crystallography can locate every atom in a zeolite, an 

aluminosilicate with many important applications, such as water 

purification. 
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vitamins, drugs, proteins and nucleic acids such as DNA. X-ray crystallography is still the 
chief method for characterizing the atomic structure of new materials and in discerning 
materials that appear similar by other experiments. X-ray crystal structures can also 
account for unusual electronic or elastic properties of a material, shed light on chemical 
interactions and processes, or serve as the basis for designing pharmaceuticals against 
diseases. 

After a crystal has been obtained or grown in the laboratory, it is mounted on a goniometer 
and gradually rotated while being bombarded with X-rays, producing a diffraction pattern 
of regularly spaced spots known as reflections. The two-dimensional images taken at 
different rotations are converted into a three-dimensional model of the density of electrons 
within the crystal using the mathematical method of Fourier transforms, combined with 
chemical data known for the sample. Poor resolution (fuzziness) or even errors may result if 
the crystals are too small, or not uniform enough in their internal makeup. 

X-ray crystallography is related to several other methods for determining atomic structures. 
Similar diffraction patterns can be produced by scattering electrons or neutrons, which are 
likewise interpreted as a Fourier transform. If single crystals of sufficient size cannot be 
obtained, various X-ray scattering methods can be applied to obtain less detailed 
information; such methods include fiber diffraction, powder diffraction and small-angle 
X-ray scattering (SAXS). In all these methods, the scattering is elastic; the scattered X-rays 
have the same wavelength as the incoming X-ray. By contrast, inelastic X-ray scattering 
methods are useful in studying excitations of the sample, rather than the distribution of its 
atoms. 



History 



Early scientific history of crystals and X-rays 

Crystals have long been admired for their regularity 
and symmetry, but they were not investigated 
scientifically until the 17th century. Johannes Kepler 
hypothesized in his work Strena seu de Nive Sexangula 
(1611) that the hexagonal symmetry of snowflake 
crystals was due to a regular packing of spherical water 
particles. ] 
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Drawing of square (Figure A, above) 

and hexagonal (Figure B, below) 

packing from Kepler's work, Strena 

seu de Nive Sexangula. 
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As shown by X-ray crystallography, the 

hexagonal symmetry of snowflakes 

results from the tetrahedral 

arrangement of hydrogen bonds about 

each water molecule. The water 

molecules are arranged simiarly to the 

silicon atoms in the tridymite 

polymorph of SiO . The resulting 

crystal structure has hexagonal 

symmetry when viewed along a 

principal axis. 

X-ray crystallography/ 7 - 1 however, 
models as conclusive. 



Crystal symmetry was first investigated experimentally 
by Nicolas Steno (1669), who showed that the angles 
between the faces are the same in every exemplar of a 
particular type of crystal, ] and by Rene Just Haiiy 
(1784), who discovered that every face of a crystal can 
be described by simple stacking patterns of blocks of 
the same shape and size. Hence, William Hallowes 
Miller in 1839 was able to give each face a unique label 
of three small integers, the Miller indices which are still 
used today for identifying crystal faces. Haiiy's study 
led to the correct idea that crystals are a regular 
three-dimensional array (a Bravais lattice) of atoms and 
molecules; a single unit cell is repeated indefinitely 
along three principal directions that are not necessarily 
perpendicular. In the 19th century, a complete catalog 
of the possible symmetries of a crystal was worked out 
by Johann Hessel, J Auguste Bravais, J Yevgraf 
Fyodorov, ] , Arthur Schonflies [6] and (belatedly) 
William Barlow. On the basis of the available data and 
physical reasoning, Barlow proposed several crystal 
structures in the 1880s that were validated later by 
the available data were too few in the 1880s to accept his 



X-rays were discovered by Wilhelm Conrad Rontgen in 
1895, just as the studies of crystal symmetry were 
being concluded. Physicists were initially uncertain of 
the nature of X-rays, although it was soon suspected 
(correctly) that they were waves of electromagnetic 
radiation, in other words, another form of light. At that 
time, the wave model of light — specifically, the 
Maxwell theory of electromagnetic radiation — was well 
accepted among scientists, and experiments by Charles 
Glover Barkla showed that X-rays exhibited phenomena 
associated with electromagnetic waves, including 
transverse polarization and spectral lines akin to those 
observed in the visible wavelengths. Single-slit 
experiments in the laboratory of Arnold Sommerfeld 
suggested the wavelength of X-rays was roughly 1 
Angstrom, one ten millionth of a millimetre. However, 
X-rays are composed of photons, and thus are not only 



Hydrogen 
bonds 




X-ray crystallography shows the 

arrangement of water molecules in ice, 

revealing the hydrogen bonds that hold 

the solid together. Few other methods 

can determine the structure of matter 

with such sub-atomic precision 

(resolution). 



[8] 



waves of electromagnetic radiation but also exhibit 

particle-like properties. The photon concept was introduced by Albert Einstein in 1905, 
but it was not broadly accepted until 1922, ] [ ] when Arthur Compton confirmed it by the 
scattering of X-rays from electrons. 1 J Therefore, these particle-like properties of X-rays, 



such as their ionization of gases, caused William Henry Bragg to argue in 1907 that X-rays 
were not electromagnetic radiation. ] Nevertheless, Bragg's view was not broadly 
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accepted and the observation of X-ray diffraction in 1912 [13] confirmed for most scientists 
that X-rays were a form of electromagnetic radiation. 



X-ray analysis of crystals 
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The incoming beam (coming from 

upper left) causes each scatterer to 

re-radiate a small portion of its energy 

as a spherical wave. If scatterers are 

arranged symmetrically with a 
separation d, these spherical waves 
will be in synch (add constructively) 

only in directions where their 

path-length difference 2d sin 9 equals 

an integer multiple of the wavelength 

A. In that case, part of the incoming 

beam is deflected by an angle 29, 

producing a reflection spot in the 

diffraction pattern. 



Crystals are regular arrays of atoms, and X-rays can be 
considered waves of electromagnetic radiation. Atoms 
scatter X-ray waves, primarily through the atoms' 
electrons. Just as an ocean wave striking a lighthouse 
produces secondary circular waves emanating from the 
lighthouse, so an X-ray striking an electron produces 
secondary spherical waves emanating from the 
electron. This phenomenon is known as elastic 
scattering, and the electron (or lighthouse) is known as 
the scatterer. A regular array of scatterers produces a 
regular array of spherical waves. Although these waves 
cancel one another out in most directions through 
destructive interference, they add constructively in a 
few specific directions, determined by Bragg's law: 



2d sin 9 = nX 

where d is the spacing between diffracting planes, 9 is the incident angle, n is any integer, 
and A is the wavelength of the beam. These specific directions appear as spots on the 
diffraction pattern, often called reflections. Thus, X-ray diffraction results from an 
electromagnetic wave (the X-ray) impinging on a regular array of scatterers (the repeating 
arrangement of atoms within the crystal). 

X-rays are used to produce the diffraction pattern because their wavelength A is typically 
the same order of magnitude (1-100 Angstroms) as the spacing d between planes in the 
crystal. In principle, any wave impinging on a regular array of scatterers produces 
diffraction, as predicted first by Francesco Maria Grimaldi in 1665. To produce significant 
diffraction, the spacing between the scatterers and the wavelength of the impinging wave 
should be roughly similar in size. For illustration, the diffraction of sunlight through a bird's 
feather was first reported by James Gregory in the later 17th century. The first man-made 
diffraction gratings for visible light were constructed by David Rittenhouse in 1787, and 
Joseph von Fraunhofer in 1821. However, visible light has too long a wavelength (typically, 
5500 Angstroms) to observe diffraction from crystals. However, prior to the first X-ray 
diffraction experiments, the spacings between lattice planes in a crystal were not known 
with certainty. 

The idea that crystals could be used as a diffraction grating for X-rays arose in 1912 in a 
conversation between Paul Peter Ewald and Max von Laue in the English Garden in 
Munich. Ewald had proposed a resonator model of crystals for his thesis, but this model 
could not be validated using visible light, since the wavelength was much larger than the 
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spacing between the resonators. Von Laue realized that electromagnetic radiation of a 

shorter wavelength was needed to observe such small spacings, and suggested that X-rays 

might have a wavelength comparable to the unit-cell spacing in crystals. Von Laue worked 

with two technicians, Walter Friedrich and his assistant Paul Knipping, to shine a beam of 

X-rays through a copper sulphate crystal and record its diffraction on a photographic plate. 

After being developed, the plate showed a large number of well-defined spots arranged in a 

pattern of intersecting circles around the spot produced by the central beam. 13] [14] Von 

Laue developed a law that connects the scattering angles and the size and orientation of 

the unit-cell spacings in the crystal, for which he was awarded the Nobel Prize in Physics in 
1914.H5] 

As described in the mathematical derivation below, the X-ray scattering is determined by 
the density of electrons within the crystal. Since the energy of an X-ray is much greater 
than that of an atomic electron, the scattering may be modeled as Thomson scattering, the 
interaction of an electromagnetic ray with a free electron. This model is generally adopted 
to describe the polarization of the scattered radiation. The intensity of Thomson scattering 
declines as 1/m 2 with the mass m of the charged particle that is scattering the radiation; 
hence, the atomic nuclei, which are thousands of times heavier than an electron, contribute 
negligibly to the scattered X-rays. 



Development from 1912 to 1920 

After Von Laue's pioneering research, the field 
developed rapidly, most notably by physicists William 
Lawrence Bragg and his father William Henry Bragg. In 
1912-1913, the younger Bragg developed Bragg's law, 
which connects the observed scattering with reflections 
from evenly spaced planes within the crystal. J The 
earliest structures were generally simple and marked 
by one-dimensional symmetry. However, as 
computational and experimental methods improved 
over the next decades, it became feasible to deduce 
reliable atomic positions for more complicated two- and 
three-dimensional arrangements of atoms in the 
unit-cell. 

The potential of X-ray crystallography for determining 
the structure of molecules and minerals — then only 
known vaguely from chemical and hydrodynamic 
experiments — was realized immediately. The earliest 
structures were simple inorganic crystals and minerals, 
but even these revealed fundamental laws of physics 
and chemistry. The first atomic-resolution structure to 
be solved (in 1914) was that of table salt. [17] (When an 
atomic structure is determined by X-ray 
crystallography, it is said to be "solved".) The 
distribution of electrons in the table-salt structure 




Although diamonds (top left) and 

graphite (top right) are identical in 

chemical composition — being both 

pure carbon — X-ray crystallography 

revealed the arrangement of their 

atoms (bottom), which accounts for 

their different properties. In diamond, 

the carbon atoms are arranged 

tetrahedrally and held together by 

single covalent bonds, making it strong 

in all directions. By contrast, graphite 

is composed of stacked sheets, in 

which the carbon atoms are bonded 

hexagonally by delocalized bonds of 

average order 4/3; there are no 

covalent bonds between sheets, 

making graphite easy to cleave into 

flakes. 
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showed that crystals are not necessarily comprised of covalently bonded molecules, and 
proved the existence of ionic compounds. J The structure of diamond was solved in the 
same year, 9] proving the tetrahedral arrangement of its chemical bonds and showing that 
the C-C single bond was 1.52 Angstroms. Other early structures included copper, ] 
calcium fluoride (CaF also known as fluorite), calcite (CaCO ) and pyrite (FeS J in 

T221 

1914; spinel (MgAl O ) in 1915; L J the rutile and anatase forms of titanium dioxide (TiO ) 

roQ] z 4 Z 

in 1916; L J pyrochroite and, by extension, brucite [Mn(OH) and Mg(OH) respectively] in 
1919; [24] and wurtzite (hexagonal ZnS) in 1920. [25] 

The structure of graphite was solved in 1916 L J by the related method of powder 
diffraction, 7] which was developed by Peter Debye and Paul Scherrer and, independently, 
by Albert Hull in 1917. J The structure of graphite was determined from single-crystal 
diffraction in 1924 by two groups independently. 9] [30] Hull also used the powder method 
to determine the structures of various metals, such as iron L J and magnesium. J 

Contributions to chemistry and material science 

X-ray crystallography has led to a better understanding of chemical bonds and non-covalent 
interactions. The initial studies revealed the typical radii of atoms, and confirmed many 
theoretical models of chemical bonding, such as the tetrahedral bonding of carbon in the 
diamond structure, ] the octahedral bonding of metals observed in ammonium 
hexachloroplatinate (IV), J and the resonance observed in the planar carbonate group L J 
and in aromatic molecules. ] [ 5] Kathleen Lonsdale's 1928 structure of 
hexamethylbenzene L J established the hexagonal symmetry of benzene and showed a clear 
difference in bond length between the aliphatic C-C bonds and aromatic C-C bonds; this 
finding led to the idea of resonance between chemical bonds, which had profound 
consequences for the development of chemistry. ] Her conclusions were anticipated by 
William Henry Bragg, who published models of naphthalene and anthracene in 1921 based 
on other molecules, an early form of molecular replacement. ] 

Also in the 1920s, Victor Moritz Goldschmidt and later Linus Pauling developed rules for 
eliminating chemically unlikely structures and for determining the relative sizes of atoms. 
These rules led to the structure of brookite (1928) and an understanding of the relative 
stability of the rutile, brookite and anatase forms of titanium oxide. 

The distance between two covalently bonded atoms is a sensitive measure of the bond 
strength and its bond order; thus, X-ray crystallographic studies have led to the discovery 
of even more exotic types of bonding in inorganic chemistry, such as metal-metal double 
bonds, 7] metal-metal quadruple bonds, ] and three-center, two-electron bonds. ] X-ray 
crystallography — or, strictly speaking, an inelastic Compton scattering experiment — has 
also provided evidence for the partially covalent character of hydrogen bonds. ] In the 
field of organometallic chemistry, the X-ray structure of ferrocene initiated scientific 
studies of sandwich compounds, ] while that of Zeise's salt stimulated research into "back 
bonding" and metal-pi complexes in general. J Finally, X-ray crystallography had a 
pioneering role in the development of supramolecular chemistry, particularly in clarifying 
the structures of the crown ethers and the principles of host-guest chemistry. 

In material sciences, many complicated inorganic and organometallic systems have been 
analyzed using single-crystal methods, such as fullerenes, metalloporphyrins, and other 
complicated compounds. Single-crystal diffraction is also used in the pharmaceutical 
industry, due to recent problems with polymorphs. The major factors affecting the quality of 
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single-crystal structures are the crystal's size and regularity; recrystallization is a 
commonly used technique to improve these factors in small-molecule crystals. The 
Cambridge Structural Database contains over 400,000 structures; over 99% of these 
structures were determined by X-ray diffraction. 



Mineralogy and metallurgy 

Since the 1920s, X-ray diffraction has been the principal method for determining the 
arrangement of atoms in minerals and metals. The application of X-ray crystallography to 
mineralogy began with the structure of garnet, which was determined in 1924 by Menzer. A 
systematic X-ray crystallographic study of the silicates was undertaken in the 1920s. This 
study showed that, as the Si/O ratio is altered, the silicate crystals exhibit significant 
changes in their atomic arrangements. Machatschki extended these insights to minerals in 
which aluminium substitutes for the silicon atoms of the silicates. The first application of 
X-ray crystallography to metallurgy likewise occurred in the mid-1 920s. J Most notably, 



Linus Pauling's structure of the alloy Mg Sn 
structure of complex ionic crystals. J 



[44] 



led to his theory of the stability and 



Early organic and small biological molecules 

The first structure of an organic compound, 
hexamethylenetetramine, was solved in 1923. J This 
was followed by several studies of long-chain fatty 
acids, which are an important component of biological 
membranes. 7] In the 1930s, the structures of much 
larger molecules with two-dimensional complexity 
began to be solved. A significant advance was the 
structure of phthalocyanine, J a large planar molecule 
that is closely related to porphyrin molecules important 
in biology, such as heme, corrin and chlorophyll. 




X-ray crystallography of biological molecules took off 
with Dorothy Crowfoot Hodgkin, who solved the 
structures of cholesterol (1937), vitamin B12 (1945) 
and penicillin (1954), for which she was awarded the 
Nobel Prize in Chemistry in 1964. In 1969, she 
succeeded in solving the structure of insulin, on which she worked for over thirty years. J 



The three-dimensional structure of 
penicillin, for which Dorothy Crowfoot 
Hodgkin was awarded the Nobel Prize 
in Chemistry in 1964. The green, 
white, red, yellow and blue spheres 
represent atoms of carbon, hydrogen, 
oxygen, sulfur and nitrogen, 
respectively. 
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Ribbon diagram of the structure of 

myoglobin, showing colored alpha 

helices. Such proteins are long, linear 

molecules with thousands of atoms; yet 

the relative position of each atom has 

been determined with sub-atomic 

resolution by X-ray crystallography. 

Since it is difficult to visualize all the 

atoms at once, the ribbon shows the 

rough path of the protein polymer from 

its N-terminus (blue) to its C-terminus 

(red). 



Protein crystallography 

Crystal structures of proteins (which are irregular and 
hundreds of times larger than cholesterol) began to be 
solved in the late 1950s, beginning with the structure of 
sperm whale myoglobin by Max Perutz and Sir John 
Cowdery Kendrew, for which they were awarded the 
Nobel Prize in Chemistry in 1962. [50] Since that 
success, over 48970 X-ray crystal structures of 
proteins, nucleic acids and other biological molecules 
have been determined. ] For comparison, the nearest 
competing method, nuclear magnetic resonance (NMR) 
spectroscopy has produced 7806 structures. ] 
Moreover, crystallography can solve structures of 
arbitrarily large molecules, whereas solution-state NMR 
is restricted to relatively small molecules (less than 70 
kDa). X-ray crystallography is now used routinely by 
scientists to determine how a pharmaceutical interacts 
with its protein target and what changes might be 
advisable to improve it. J However, intrinsic 
membrane proteins remain challenging to crystallize 
because they require detergents or other means to 
solubilize them in isolation, and such detergents often 



interfere with crystallization. Such membrane proteins 
are a large component of the genome and include many proteins of great physiological 
importance, such as ion channels and receptors. J L J 



Relationship to other scattering techniques 



Elastic vs. inelastic scattering 

X-ray crystallography is a form of elastic scattering; the outgoing X-rays have the same 
energy as the incoming X-rays, only with altered direction. Since the energy of a photon is 
inversely proportional to its wavelength, elastic scattering means that the outgoing photons 
have the same wavelength as the incoming photons. By contrast, inelastic scattering occurs 
when energy is transferred from the incoming X-ray to the crystal, e.g., by exciting an 
inner-shell electron to a higher energy level. Such inelastic scattering changes the 
wavelength of the outgoing beam, making it longer and less energetic. Inelastic scattering 
is useful for probing such excitations of matter, but are not as useful in determining the 
distribution of scatterers within the matter, which is the goal of X-ray crystallography. 

X-rays range in wavelength from 10 to 0.01 nanometers; a typical wavelength used for 
crystallography is roughly 1 A (0.1 nm), which is on the scale of covalent chemical bonds 
and the radius of a single atom. Longer-wavelength photons (such as ultraviolet radiation) 
would not have sufficient resolution to determine the atomic positions. At the other 
extreme, shorter-wavelength photons such as gamma rays are difficult to produce in large 
numbers, difficult to focus, and interact too strongly with matter, producing 
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particle-antiparticle pairs. Therefore, X-rays are the "sweetspot" for wavelength when 
determining atomic-resolution structures from the scattering of electromagnetic radiation. 

Other types of X-ray scattering 

X-ray diffraction involves the scattering of X-rays from a single crystal. Other forms of 
elastic X-ray scattering include powder diffraction, SAXS and several types of X-ray fiber 
diffraction, which was used by Rosalind Franklin in determining the double-helix structure 
of DNA. In general, X-ray diffraction produces isolated spots ("reflections"), while the other 
methods produce smooth, continuous scattering. In general, X-ray diffraction offers more 
structural information than these other techniques; however, it requires a sufficiently large 
and regular crystal, which is not always possible to obtain. 

All of these scattering methods generally use monochromatic X-rays, which are restricted to 
a single wavelength with minor deviations. A broad spectrum of X-rays (that is, a blend of 
X-rays with different wavelengths) can also be used to carry out X-ray diffraction, a 
technique known as the Laue method. This is the method used in the original discovery of 
X-ray diffraction. Laue scattering provides much structural information with only a short 
exposure to the X-ray beam, and is therefore used in structural studies of very rapid events 
(time-resolved X-ray crystallography). However, it is not as well-suited as monochromatic 
scattering for determining the full atomic structure of a crystal. It is better suited to 
crystals with relatively simple atomic arrangements, such as minerals. 

The Laue back reflection mode records X-rays scattered backwards also from a broad 
spectrum source. This is useful if the sample is too thick or bulky for X-rays to transmit 
through it. The diffracting planes in the crystal are determined by knowing that the normal 
to the diffracting plane bisects the angle between the incident beam and the diffracted 
beam. A Greninger chart can be used [5 ] to interpret the back reflection Laue photograph. 
The X-calibre RTXDB and MWL 110 are commercial systems for Laue back reflection 
pattern recording. This technique can be used in materials analysis or nondestructive 
testing. 

Electron and neutron diffraction 

Other particles, such as electrons and neutrons, may be used to produce a diffraction 
pattern. Although electron, neutron, and X-ray scattering use very different equipment, the 
resulting diffraction patterns are analyzed using the same coherent diffraction imaging 
techniques. 

As derived below, the electron density within the crystal and the diffraction patterns are 
related by a simple mathematical method, the Fourier transform, which allows the density 
to be calculated relatively easily from the patterns. However, this works only if the 
scattering is weak, i.e., if the scattered beams are much less intense than the incoming 
beam. Weakly scattered beams pass through the remainder of the crystal without 
undergoing a second scattering event. Such re-scattered waves are called "secondary 
scattering" and hinder the calculation of the density of scatterers. Any sufficiently thick 
crystal will produce secondary scattering but since X-rays interact relatively weakly with 
the electrons, this is generally not a significant concern. By contrast, electron beams may 
produce strong secondary scattering even for very small crystals (e.g., 100 |im) used in 
X-ray crystallography. In such cases, extremely thin samples, roughly 100 nanometers or 
less, must be used to avoid secondary scattering; the primary scattered electron beams 
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leave the sample before they have a chance to undergo secondary scattering. Since this 
thickness corresponds roughly to the diameter of many viruses, a promising direction is the 
electron diffraction of isolated macromolecular assemblies, such as viral capsids and 
molecular machines, which may be carried out with a cryo-electron microscope. 

Neutron diffraction is an excellent method for structure determination, although it has been 
difficult to obtain intense, monochromatic beams of neutrons in sufficient quantities. 
Traditionally, nuclear reactors have been used, although the new Spallation Neutron 
Source holds much promise in the near future. Being uncharged, neutrons scatter much 
more readily from the atomic nuclei rather than from the electrons. Therefore, neutron 
scattering is very useful for observing the positions of light atoms with few electrons, 
especially hydrogen, which is essentially invisible in the X-ray diffraction of larger 
molecules. Neutron scattering also has the remarkable property that the solvent can be 
made invisible by adjusting the ratio of normal water, HO, and heavy water, DO. 

Methods 



Overview of single-crystal X-ray diffraction 

The oldest and most precise method of X-ray crystallography 
is single-crystal X-ray diffraction, in which a beam of X-rays 
strikes a single crystal, producing scattered beams. When 
they land on a piece of film or other detector, these beams 
make a diffraction pattern of spots; the strengths and angles 
of these beams are recorded as the crystal is gradually 
rotated. 7] Each spot is called a reflection, since it 
corresponds to the reflection of the X-rays from one set of 
evenly spaced planes within the crystal. For single crystals of 
sufficient purity and regularity, X-ray diffraction data can 
determine the mean chemical bond lengths and angles to 
within a few thousandths of an Angstrom and to within a few 
tenths of a degree, respectively. The atoms in a crystal are 
also not static, but oscillate about their mean positions, 
usually by less than a few tenths of an Angstrom. X-ray 
crystallography allows the size of these oscillations to be 
measured quantitatively. 
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Workflow for solving the 

structure of a molecule by X-ray 

crystallography. 



Procedure 

The technique of single-crystal X-ray crystallography has three basic steps. The first — and 
often most difficult — step is to obtain an adequate crystal of the material under study. The 
crystal should be sufficiently large (typically larger than 100 micrometres in all 
dimensions), pure in composition and regular in structure, with no significant internal 
imperfections such as cracks or twinning. A small or irregular crystal will give fewer and 
less reliable data, from which it may be impossible to determine the atomic arrangement. 

In the second step, the crystal is placed in an intense beam of X-rays, usually of a single 
wavelength (monochromatic X-rays), producing the regular pattern of reflections. As the 
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crystal is gradually rotated, previous reflections disappear and new ones appear; the 
intensity of every spot is recorded at every orientation of the crystal. Multiple data sets may 
have to be collected, with each set covering slightly more than half a full rotation of the 
crystal and typically containing tens of thousands of reflection intensities. 

In the third step, these data are combined computationally with complementary chemical 
information to produce and refine a model of the arrangement of atoms within the crystal. 
The final, refined model of the atomic arrangement — now called a crystal structure — is 
usually stored in a public database. 

Limitations 

As the crystal's repeating unit, its unit cell, becomes larger and more complex, the 
atomic-level picture provided by X-ray crystallography becomes less well-resolved (more 
"fuzzy") for a given number of observed reflections. Two limiting cases of X-ray 
crystallography— "small-molecule" and "macromolecular" crystallography— are often 
discerned. Small-molecule crystallography typically involves crystals with fewer than 100 
atoms in their asymmetric unit; such crystal structures are usually so well resolved that the 
atoms can be discerned as isolated "blobs" of electron density. By contrast, macromolecular 
crystallography often involves tens of thousands of atoms in the unit cell. Such crystal 
structures are generally less well-resolved (more "smeared out"); the atoms and chemical 
bonds appear as tubes of electron density, rather than as isolated atoms. In general, small 
molecules are also easier to crystallize than macromolecules; however, X-ray 
crystallography has proven possible even for viruses with hundreds of thousands of atoms. 

Crystallization 



Although crystallography can be used to characterize 
the disorder in an impure or irregular crystal, 
crystallography generally requires a pure crystal of 
high regularity to solve for the structure of a 
complicated arrangement of atoms. Pure, regular 
crystals can sometimes be obtained from natural or 
man-made materials, such as samples of metals, 
minerals or other macroscopic materials. The regularity 
of such crystals can sometimes be improved with 
A protein crystal seen under a annealing and other methods. However, in many cases, 

microscope. Crystals used in X-ray obtaining a diffraction-quality crystal is the chief 

crystallography are generally smaller rp-oi 

than a millimeter across. barrier to solving its atomic-resolution structure. 1 




Small-molecule and macromolecular crystallography 
differ in the range of possible techniques used to produce diffraction-quality crystals. Small 
molecules generally have few degrees of conformational freedom, and may be crystallized 
by a wide range of methods, such as chemical vapor deposition and recrystallisation. By 
contrast, macromolecules generally have many degrees of freedom and their crystallization 
must be carried out to maintain a stable structure. For example, proteins and larger RNA 
molecules cannot be crystallized if their tertiary structure has been unfolded; therefore, the 
range of crystallization conditions is restricted to solution conditions in which such 
molecules remain folded. 
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Protein crystals are almost always grown in solution. The most common approach is to 
lower the solubility of its component molecules very gradually; however, if this is done too 
quickly, the molecules will precipitate from solution, forming a useless dust or amorphous 
gel on the bottom of the container. Crystal growth in solution is characterized by two steps: 
nucleation of a microscopic crystallite (possibly having only 100 molecules), followed by 
growth of that crystallite, ideally to a diffraction-quality crystal. J The solution conditions 
that favor the first step (nucleation) are not always the same conditions that favor the 
second step (its subsequent growth). The crystallographer's goal is to identify solution 
conditions that favor the development of a single, large crystal, since larger crystals offer 
improved resolution of the molecule. Consequently, the solution conditions should disfavor 
the first step (nucleation) but favor the second (growth), so that only one large crystal 
forms per droplet. If nucleation is favored too much, a shower of small crystallites will form 
in the droplet, rather than one large crystal; if favored too little, no crystal will form 
whatsoever. 

It is extremely difficult to predict good conditions for nucleation or growth of well-ordered 
crystals. 60] In practice, favorable conditions are identified by screening; a very large batch 
of the molecules is prepared, and a wide variety of crystallization solutions are tested. J 
Hundreds, even thousands, of solution conditions are generally tried before finding one that 
succeeds in crystallizing the molecules. The various conditions can use one or more 
physical mechanisms to lower the solubility of the molecule; for example, some may change 
the pH, some contain salts of the Hofmeister series or chemicals that lower the dielectric 
constant of the solution, and still others contain large polymers such as polyethylene glycol 
that drive the molecule out of solution by entropic effects. It is also common to try several 
temperatures for encouraging crystallization, or to gradually lower the temperature so that 
the solution becomes supersaturated. These methods require large amounts of the target 
molecule, as they use high concentration of the molecule(s) to be crystallized. Due to the 
difficulty in obtaining such large quantities (milligrams) of crystallisation grade protein, 
dispensing robots have been developed that are capable of accurately dispensing 
crystallisation trial drops that are of the order on 100 nanoliters in volume. This means that 
roughly 10-fold less protein is used per-experiment when compared to crystallisation trials 
setup by hand (on the order on 1 microliter). J 

Several factors are known to inhibit or mar crystallization. The growing crystals are 
generally held at a constant temperature and protected from shocks or vibrations that 
might disturb their crystallization. Impurities in the molecules or in the crystallization 
solutions are often inimical to crystallization. Conformational flexibility in the molecule also 
tends to make crystallization less likely, due to entropy. Ironically, molecules that tend to 
self-assemble into regular helices are often unwilling to assemble into crystals. Crystals can 
be marred by twinning, which can occur when a unit cell can pack equally favorably in 
multiple orientations; although recent advances in computational methods have begun to 
allow the structures of twinned crystals to be solved, it is still very difficult. Having failed to 
crystallize a target molecule, a crystallographer may try again with a slightly modified 
version of the molecule; even small changes in molecular properties can lead to large 
differences in crystallization behavior. 
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Data collection 

Mounting the crystal 

Once they are full-grown, the crystals are mounted so that they may be held in the X-ray 
beam and rotated. There are several methods of mounting. Although crystals were once 
loaded into glass capillaries with the crystallization solution (the mother liquor), a more 
modern approach is to scoop the crystal up in a tiny loop, made of nylon or plastic and 
attached to a solid rod, that is then flash-frozen with liquid nitrogen. J This freezing 
reduces the radiation damage of the X-rays, as well as the noise in the Bragg peaks due to 
thermal motion (the Debye-Waller effect). However, untreated crystals often crack if 
flash-frozen; therefore, they are generally pre-soaked in a cryoprotectant solution before 
freezing. J Unfortunately, this pre-soak may itself cause the crystal to crack, ruining it for 
crystallography. Generally, successful cryo-conditions are identified by trial and error. 

The capillary or loop is mounted on a goniometer, which allows it to be positioned 
accurately within the X-ray beam and rotated. Since both the crystal and the beam are 
often very small, the crystal must be centered within the beam to within roughly 25 
micrometres accuracy, which is aided by a camera focused on the crystal. The most 
common type of goniometer is the "kappa goniometer", which offers three angles of 
rotation: the gj angle, which rotates about an axis roughly perpendicular to the beam; the k 
angle, about an axis at roughly 50° to the cj axis; and, finally, the cp angle about the 
loop/capillary axis. When the k angle is zero, the cj and cp axes are aligned. The k rotation 
allows for convenient mounting of the crystal, since the arm in which the crystal is mounted 
may be swung out towards the crystallographer. The oscillations carried out during data 
collection (mentioned below) involve the cj axis only. An older type of goniometer is the 
four-circle goniometer, and its relatives such as the six-circle goniometer. 

X-ray sources 

The mounted crystal is then irradiated with a beam of monochromatic X-rays. The brightest 
and most useful X-ray sources are synchrotrons; their much higher luminosity allows for 
better resolution. They also make it convenient to tune the wavelength of the radiation, 
which is useful for multi-wavelength anomalous dispersion (MAD) phasing, described 
below. Synchrotrons are generally national facilities, each with several dedicated beamlines 
where data is collected around the clock, seven days a week. 

Smaller, weaker X-ray sources are often used in 
laboratories to check the quality of crystals before 
bringing them to a synchrotron and sometimes to solve 
a crystal structure. In such systems, electrons are 
boiled off of a cathode and accelerated through a 
strong electric potential of roughly 50 kV; having 
reached a high speed, the electrons collide with a metal 
plate, emitting bremsstrahlung and some strong 
spectral lines corresponding to the excitation of 
inner-shell electrons of the metal. The most common 
metal used is copper, which can be kept cool easily, due 




A diffractometer 



to its high thermal conductivity, and which produces strong K and K lines. The K line is 
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sometimes suppressed with a thin layer (0.0005 in. thick) of nickel foil. The simplest and 
cheapest variety of sealed X-ray tube has a stationary anode (the Crookes tube) and 
produces circa 2 kW of X-ray radiation. The more expensive variety has a rotating-anode 
type source that produces circa 14 kW of X-ray radiation. 

X-rays are generally filtered (by use of X-Ray Filters) to a single wavelength (made 
monochromatic) and collimated to a single direction before they are allowed to strike the 
crystal. The filtering not only simplifies the data analysis, but also removes radiation that 
degrades the crystal without contributing useful information. Collimation is done either 
with a collimator (basically, a long tube) or with a clever arrangement of gently curved 
mirrors. Mirror systems are preferred for small crystals (under 0.3 mm) or with large unit 
cells (over 150 A). 




An X-ray diffraction pattern of a 

crystallized enzyme. The pattern of 

spots (called reflections) can be used 

to determine the structure of the 

enzyme. 



Recording the reflections 

When a crystal is mounted and exposed to an intense 

beam of X-rays, it scatters the X-rays into a pattern of 

spots or reflections that can be observed on a screen 

behind the crystal. A similar pattern may be seen by 

shining a laser pointer at a compact disc. The relative 

intensities of these spots provide the information to 

determine the arrangement of molecules within the 

crystal in atomic detail. The intensities of these 

reflections may be recorded with photographic film, an 

area detector or with a charge-coupled device (CCD) 

image sensor. The peaks at small angles correspond to 

low-resolution data, whereas those at high angles 

represent high-resolution data; thus, an upper limit on 

the eventual resolution of the structure can be 

determined from the first few images. Some measures 

of diffraction quality can be determined at this point, 

such as the mosaicity of the crystal and its overall 

disorder, as observed in the peak widths. Some pathologies of the crystal that would render 

it unfit for solving the structure can also be diagnosed quickly at this point. 

One image of spots is insufficient to reconstruct the whole crystal; it represents only a small 
slice of the full Fourier transform. To collect all the necessary information, the crystal must 
be rotated step-by-step through 180°, with an image recorded at every step; actually, 
slightly more than 180° is required to cover reciprocal space, due to the curvature of the 
Ewald sphere. However, if the crystal has a higher symmetry, a smaller angle such as 90° 
or 45° may be recorded. The axis of the rotation should generally be changed at least once, 
to avoid developing a "blind spot" in reciprocal space close to the rotation axis. It is 
customary to rock the crystal slightly (by 0.5-2°) to catch a broader region of reciprocal 
space. 

Multiple data sets may be necessary for certain phasing methods. For example, MAD 
phasing requires that the scattering be recorded at least three (and usually four, for 
redundancy) wavelengths of the incoming X-ray radiation. A single crystal may degrade too 
much during the collection of one data set, owing to radiation damage; in such cases, data 
sets on multiple crystals must be taken. ] 
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Data analysis 

Crystal symmetry, unit cell, and image scaling 

Having recorded a series of diffraction patterns from the crystal, each corresponding to a 
different crystal orientation, the crystallographer must now convert these two-dimensional 
images into a three-dimensional model of the density of electrons throughout the crystal 
using the mathematical technique of Fourier transforms. (The relevance of this technique is 
explained below.) Roughly speaking, each spot corresponds to a different type of variation 
in the electron density; the crystallographer must determine which variation corresponds to 
which spot (indexing), the relative strengths of the spots in different images (merging and 
scaling) and how the variations should be combined to yield the total electron density 
(phasing). 

In order to process the data, a crystallographer must first index the reflections within the 
multiple images recorded. This means identifying the dimensions of the unit cell and which 
image peak corresponds to which position in reciprocal space. A byproduct of indexing is to 
determine the symmetry of the crystal, i.e., its space group. Some space groups can be 
eliminated from the beginning, since they require symmetries known to be absent in the 
molecule itself. For example, reflection symmetries cannot be observed in chiral molecules; 
thus, only 65 space groups of 243 possible are allowed for protein molecules which are 
almost always chiral. Indexing is generally accomplished using an autoindexing routine. ] 
Having assigned symmetry, the data is then integrated. This converts the hundreds of 
images containing the thousands of reflections into a single file, consisting of (at the very 
least) records of the Miller index of each reflection, and an intensity for each reflection (at 
this state the file often also includes error estimates and measures of partiality (what part 
of a given reflection was recorded on that image)). 

A full data set may consist of hundreds of separate images taken at different orientations of 
the crystal. The first step is to merge and scale these various images, that is, to identify 
which peaks appear in two or more images (merging) and to scale the relative images so 
that they have a consistent intensity scale. Optimizing the intensity scale is critical because 
the relative intensity of the peaks is the key information from which the structure is 
determined. The repetitive technique of crystallographic data collection and the often high 
symmetry of crystalline materials cause the diffractometer to record many 
symmetry-equivalent reflections multiple times. This allows a merging or symmetry related 
R-factor to be calculated based upon how similar are the measured intensities of symmetry 
equivalent reflections, thus giving a score to assess the quality of the data. 

Initial phasing 

The data collected from a diffraction experiment is a reciprocal space representation of the 
crystal lattice. The position of each diffraction 'spot' is governed by the size and shape of 
the unit cell, and the inherent symmetry within the crystal. The intensity of each diffraction 
'spot' is recorded, and this intensity is proportional to the square of the structure factor 
amplitude. The structure factor is a complex number containing information relating to 
both the amplitude and phase of a wave. In order to obtain an interpretable electron 
density map, both amplitude and phase must be known (an electron density map allows a 
crystallographer to build a starting model of the molecule). The phase cannot be directly 
recorded during a diffraction experiment: this is known as the phase problem. Initial phase 
estimates can be obtained in a variety of ways: 
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• Ab initio phasing, aka direct methods - This is usually the method of choice for small 
molecules (<1000 non-hydrogen atoms), and has been used successfully to solve the 
phase problems for small proteins. If the resolution of the data is better than 1.4 A 
(140 pm), direct methods can be used to obtain phase information, by exploiting known 
phase relationships between certain groups of reflections. 67 - 1 [68] 

• Molecular replacement - if a structure exists of a related structure, it can be used as a 
search model in molecular replacement to determine the orientation and position of the 
molecules within the unit cell. The phases obtained this way can be used to generate 
electron density maps} J 

• Anomalous X-ray scattering (MAD or SAD phasing) - the X-ray wavelength may be 
scanned past an absorption edge of an atom, which changes the scattering in a known 
way. By recording full sets of reflections at three different wavelengths (far below, far 
above and in the middle of the absorption edge) one can solve for the substructure of the 
anomalously diffracting atoms and thence the structure of the whole molecule. The most 
popular method of incorporating anomalous scattering atoms into proteins is to express 
the protein in a methionine auxotroph (a host incapable of synthesising methionine) in a 
media rich in Seleno-methionine, which contains Selenium atoms. A MAD experiment can 
then be conducted around the absorption edge, which should then yield the position of 
any methionine residues within the protein, providing initial phases. 70] 

• Heavy atom methods (ie MIR) - If electron-dense metal atoms can be introduced into 
the crystal, direct methods or Patterson-space methods can be used to determine their 
location and to obtain initial phases. Typically, a crystallographer can introduce such 
heavy atoms either by soaking the crystal in a heavy atom-containing solution, or by 
co-crystallization (growing the crystals in the presence of a heavy atom). As in MAD 
phasing, the changes in the scattering amplitudes can be interpreted to yield the phases. 
Although this is the original method by which protein crystal structures were solved, it 
has largely been superseded by MAD phasing with selenomethionine. J 

While all four of the above methods are used to solve the phase problem for protein 
crystallography, small molecule crystallography generally yields data suitable for structure 
solution using direct methods (ab initio phasing). 
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Model building and phase refinement 

Having obtained initial phases, an initial model can be 
built. This model can be used to refine the phases, 
leading to an improved model, and so on. Given a model 
of some atomic positions, these positions and their 
respective Debye-Waller factors (accounting for the 
thermal motion of the atom - aka B-factors) can be 
refined to fit the observed diffraction data, ideally 
yielding a better set of phases. A new model can then 
be fit to the new electron density map and a further 
round of refinement is carried out. This continues until 
the correlation between the diffraction data and the 
model is maximized. The agreement is measured by an 
K-factor defined as 




A protein crystal structure at 2.7 A 

resolution. The mesh encloses the 

region in which the electron density 

exceeds a given threshold. The straight 

segments represent chemical bonds 

between the non-hydrogen atoms of an 

arginine (upper left), a tyrosine (lower 

left), a disulfide bond (upper right, in 

yellow), and some peptide groups 
(running left-right in the middle). The 

two curved green tubes represent 
spline fits to the polypeptide backbone. 
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A similar quality criterion is R f , which is calculated from a subset (-10%) of reflections 
that were not included in the structure refinement. Both R factors depend on the resolution 
of the data. As a rule of thumb, R f should be approximately the resolution in Angstroms 
divided by 10; thus, a data-set with 2 A resolution should yield a final R of roughly 0.2. 
Chemical bonding features such as stereochemistry, hydrogen bonding and distribution of 
bond lengths and angles are complementary measures of the model quality. Phase bias is a 
serious problem in such iterative model building. Omit maps are a common technique used 
to check for this. 

It may not be possible to observe every atom of the crystallized molecule - it must be 
remembered that the resulting electron density is an average of all the molecules within the 
crystal. In some cases, there is too much residual disorder in those atoms, and the resulting 
electron density for atoms existing in many conformations is smeared to such an extent that 
it is no longer detectable in the electron density map. Weakly scattering atoms such as 
hydrogen are routinely invisible. It is also possible for a single atom to appear multiple 
times in an electron density map, e.g., if a protein sidechain has multiple (<4) allowed 
conformations. In still other cases, the crystallographer may detect that the covalent 
structure deduced for the molecule was incorrect, or changed. For example, proteins may 
be cleaved or undergo post-translational modifications that were not detected prior to the 
crystallization. 
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Deposition of the structure 

Once the model of a molecule's structure has been finalized, it is often deposited in a 
crystallographic database such as the Protein Data Bank (for protein structures) or the 
Cambridge Structural Database (for small molecules). Many structures obtained in private 
commercial ventures to crystallize medicinally relevant proteins, are not deposited in public 
crystallographic databases. 

Diffraction theory 

The main goal of X-ray crystallography is to determine the density of electrons f(r) 
throughout the crystal, where r represents the three-dimensional position vector within the 
crystal. To do this, X-ray scattering is used to collect data about its Fourier transform F(q), 
which is inverted mathematically to obtain the density defined in real space, using the 
formula 
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where the integral is summed over all possible values of q. The three-dimensional real 
vector q represents a point in reciprocal space, that is, to a particular oscillation in the 
electron density as one moves in the direction in which q points. The length of q 
corresponds to 2n divided by the wavelength of the oscillation. The corresponding formula 
for a Fourier transform will be used below 
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where the integral is summed over all possible values of the position vector r within the 
crystal. 

The Fourier transform F(q) is generally a complex number, and therefore has a magnitude 
\F(q)\ and a phase cp(q) related by the equation 

F(q} = \F( q }\e i ^ 
The intensities of the reflections observed in X-ray diffraction give us the magnitudes \F(q)\ 
but not the phases cp(q). To obtain the phases, full sets of reflections are collected with 
known alterations to the scattering, either by modulating the wavelength past a certain 
absorption edge or by adding strongly scattering (i.e., electron-dense) metal atoms such as 
mercury. Combining the magnitudes and phases yields the full Fourier transform F(q), 
which may be inverted to obtain the electron density f( r). 

Crystals are often idealized as being perfectly periodic. In that ideal case, the atoms are 
positioned on a perfect lattice, the electron density is perfectly periodic, and the Fourier 
transform F(q) is zero except when q belongs to the reciprocal lattice (the so-called Bragg 
peaks). In reality, however, crystals are not perfectly periodic; atoms vibrate about their 
mean position, and there may be disorder of various types, such as mosaicity, dislocations, 
various point defects, and heterogeneity in the conformation of crystallized molecules. 
Therefore, the Bragg peaks have a finite width and there may be significant diffuse 
scattering, a continuum of scattered X-rays that fall between the Bragg peaks. 
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Intuitive understanding by Bragg 's law 

An intuitive understanding of X-ray diffraction can be obtained from the Bragg model of 
diffraction. In this model, a given reflection is associated with a set of evenly spaced sheets 
running through the crystal, usually passing through the centers of the atoms of the crystal 
lattice. The orientation of a particular set of sheets is identified by its three Miller indices 
(h, k, /), and let their spacing be noted by d. William Lawrence Bragg proposed a model in 
which the incoming X-rays are scattered specularly (mirror-like) from each plane; from that 
assumption, X-rays scattered from adjacent planes will combine constructively (constructive 
interference) when the angle between the plane and the X-ray results in a path-length 
difference that is an integer multiple n of the X-ray wavelength A. 

2d sin = nX 
A reflection is said to be indexed when its Miller indices (or, more correctly, its reciprocal 
lattice vector components) have been identified from the known wavelength and the 
scattering angle 20. Such indexing gives the unit-cell parameters, the lengths and angles of 
the unit-cell, as well as its space group. Since Bragg's law does not interpret the relative 
intensities of the reflections, however, it is generally inadequate to solve for the 
arrangement of atoms within the unit-cell; for that, a Fourier transform method must be 
carried out. 

Scattering as a Fourier transform 

The incoming X-ray beam has a polarization and should be represented as a vector wave; 
however, for simplicity, let it be represented here as a scalar wave. We also ignore the 
complication of the time dependence of the wave and just focus on the wave's spatial 
dependence. Plane waves can be represented by a wave vector k. , and so the strength of 
the incoming wave at time t=0 is given by 

Ae ik ^' T 

At position r within the sample, let there be a density of scatterers f(r); these scatterers 
should produce a scattered spherical wave of amplitude proportional to the local amplitude 
of the incoming wave times the number of scatterers in a small volume dV about r 

amplitude of scattered wave = J 4e* k,r iS'/(r)dV F 
where S is the proportionality constant. 

Let's consider the fraction of scattered waves that leave with an outgoing wave-vector of 

k ^ and strike the screen at r . Since no energy is lost (elastic, not inelastic 

out screen * J 

scattering), the wavelengths are the same as are the magnitudes of the wave-vectors |k. | = 

Ik J. From the time that the photon is scattered at r until it is absorbed at r , the 

1 out 1 c screen 

photon undergoes a change in phase 

The net radiation arriving at r is the sum of all the scattered waves throughout the 

° screen ° 



crystal 
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which may be written as a Fourier transform 
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where q = k - k . The measured intensity of the reflection will be square of this 
amplitude 

^5 2 |F(q)| 2 

Friedel and Bijvoet mates 

For every reflection corresponding to a point q in the reciprocal space, there is another 
reflection of the same intensity at the opposite point -q. This opposite reflection is known 
as the Friedel mate of the original reflection. This symmetry results from the mathematical 
fact that the density of electrons f(r) at a position r is always a real number. As noted 
above, f(r) is the inverse transform of its Fourier transform F(q); however, such an inverse 
transform is a complex number in general. To ensure that f(r) is real, the Fourier transform 
F(q) must be such that the Friedel mates F(-q) and F(q) are complex conjugates of one 
another. Thus, F(-q) has the same magnitude as F(q)— that is, |F|(q) = |F|(-q)— but they 
have the opposite phase, i.e., cp(q) = -cp(q) 

F(-q) = |F(-q)| e W-<> = F* (q) = |F(q)| e~^ 
The equality of their magnitudes ensures that the Friedel mates have the same intensity 
\F\ . This symmetry allows one to measure the full Fourier transform from only half the 
reciprocal space, e.g., by rotating the crystal slightly more than a 180°, instead of a full 
turn. In crystals with significant symmetry, even more reflections may have the same 
intensity (Bijvoet mates); in such cases, even less of the reciprocal space may need to be 
measured, e.g., slightly more than 90°. 

The Friedel-mate constraint can be derived from the definition of the inverse Fourier 
transform 
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Since Euler's formula states that e lx = cos(x) + z sin(x), the inverse Fourier transform can 

be separated into a sum of a purely real part and a purely imaginary part 

f & = fl^\ F ^\ e ^ +Cl ' r] - /-^3inq)|cos(^ + q-r)+z / -^_ |F(q)| sin (<£ + q ■ r) == I c 
J (2?r) J (2?r) J (2tt) 

The function f(r) is real if and only if the second integral I . is zero for all values of r. In 

turn, this is true if and only if the above constraint is satisfied 

J ™ =: /-^|F(q)|sm(0 + q-r)=: / -^ |F(-q)| sin (-0 - q ■ r) = -L 
since I . = —I . implies that I . =0. 

sin sin ^ sin 

Ewald's sphere 

Each X-ray diffraction image represents only a slice, a spherical slice of reciprocal space, as 
may be seen by the Ewald sphere construction. Both k and k. have the same length, due 
to the elastic scattering, since the wavelength has not changed. Therefore, they may be 
represented as two radial vectors in a sphere in reciprocal space, which shows the values of 
q that are sampled in a given diffraction image. Since there is a slight spread in the 
incoming wavelengths of the incoming X-ray beam, the values of \F(q)\ can be measured 
only for q vectors located between the two spheres corresponding to those radii. Therefore, 
to obtain a full set of Fourier transform data, it is necessary to rotate the crystal through 
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slightly more than 180°, or sometimes less if sufficient symmetry is present. A full 360° 
rotation is not needed because of a symmetry intrinsic to the Fourier transforms of real 
functions (such as the electron density), but "slightly more" than 180° is needed to cover all 
of reciprocal space within a given resolution because of the curvature of the Ewald sphere 
(add Figure to illustrate this). In practice, the crystal is rocked by a small amount (0.25-1°) 
to incorporate reflections near the boundaries of the spherical Ewald shells. 

Patterson function 

A well-known result of Fourier transforms is the autocorrelation theorem, which states that 
the autocorrelation c(r) of a function f(r) 



=(r) = J dx/(x)/(x + r) = J -^-3 C(q)e ! <* r 



has a Fourier transform Cfqjthat is the squared magnitude of F(q) 

C(q) = |F(q)| 2 
Therefore, the autocorrelation function c(r) of the electron density (also known as the 
Patterson function^ J ) can be computed directly from the reflection intensities, without 
computing the phases. In principle, this could be used to determine the crystal structure 
directly; however, it is difficult to realize in practice. The autocorrelation function 
corresponds to the distribution of vectors between atoms in the crystal; thus, a crystal of N 
atoms in its unit cell may have N(N-l) peaks in its Patterson function. Given the inevitable 
errors in measuring the intensities, and the mathematical difficulties of reconstructing 
atomic positions from the interatomic vectors, this technique is rarely used to solve 
structures, except for the simplest crystals. 

Advantages of a crystal 

In principle, an atomic structure could be determined from applying X-ray scattering to 
non-crystalline samples, even to a single molecule. However, crystals offer a much stronger 
signal due to their periodicity. 

A crystalline sample is by definition periodic; a crystal is composed of many unit cells 
repeated indefinitely in three independent directions. Such periodic systems have a Fourier 
transform that is concentrated at periodically repeating points in reciprocal space known as 
Bragg peaks; the Bragg peaks correspond to the reflection spots observed in the diffraction 
image. Since the amplitude at these reflections grows linearly with the number N of 
scatterers, the observed intensity of these spots should grow quadratically, like AT 2 . In other 
words, using a crystal concentrates the weak scattering of the individual unit cells into a 
much more powerful, coherent reflection that can be observed above the noise. This is an 
example of constructive interference. 

In a non-crystalline sample, molecules within that sample would be in random orientations 
and therefore would have a continuous Fourier spectrum that spreads its amplitude more 
uniformly and with a much reduced intensity, as is observed in SAXS. More importantly, the 
orientational information is lost. In the crystal, the molecules adopt the same orientation 
within the crystal, whereas in a liquid, powder or amorphous state, the observed signal is 
averaged over the possible orientations of the molecules. Although theoretically possible 
with sufficiently low-noise data, it is generally difficult to obtain atomic-resolution 
structures of complicated, asymmetric molecules from such rotationally averaged 
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scattering data. An intermediate case is fiber diffraction in which the subunits are arranged 
periodically in at least one dimension. 

See also 

• Bragg diffraction 

• Bravais lattice 

• Crystallographic database 

• Crystallographic point groups 

• Difference density map 

• Electron diffraction 

• Neutron diffraction 

• Powder diffraction 

• Scherrer Equation 

• Small angle X-ray scattering (SAXS) 

• Structure determination 

• Wide angle X-ray scattering (WAXS) 

• Ptychography 
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X-ray crystallography 101 

External links 

Tutorials 

• Simple, non technical introduction (http://stein.bioch.dundee.ac.uk/~charlie/index. 
php?section=l) 

• "Small Molecule Crystalization" (http://acaschool.iit.edu/lectures04/JLiangXtal.pdf) 
(PDF) at Illinois Institute of Technology website 

• International Union of Crystallography (http://iucr.org) 

• Crystallography 101 (http://www.ruppweb.org/Xray/101index.html) 

• Interactive structure factor tutorial (http://www.ysbl.york.ac.uk/~cowtan/sfapplet/ 
sfintro.html), demonstrating properties of the diffraction pattern of a 2D crystal. 

• Picturebook of Fourier Transforms (http://www.ysbl.york.ac.uk/~cowtan/fourier/ 
fourier.html), illustrating the relationship between crystal and diffraction pattern in 2D. 

• Lecture notes on X-ray crystallography and structure determination (http://www.chem. 
uwec.edu/Chem406_F06/Pages/lectnotes. html#lecture7) 

Primary databases 

• Protein Data Bank (http://www.rcsb.org/pdb/home/home.do) (PDB) 

• Nucleic Acid Databank (http://ndbserver.rutgers.edu/) (NDB) 

• Cambridge Structural Database (http://www.ccdc.cam.ac.uk/products/csd/) (CSD) 

• Inorganic Crystal Structure Database (http://www.fiz-karlsruhe.de/icsd.html) (ICSD) 

• Biological Macromolecule Crystallization Database (http://xpdb.nist.gov:8060/BMCD4/ 
) (BMCD) 

Derivative databases 

• PDBsum (http://www.ebi.ac.uk/thornton-srv/databases/pdbsum/) 

• Proteopeida - the collaborative, 3D encyclopedia of proteins and other molecules (http:// 
www.proteopedia.org) 

• RNABase (http://www.rnabase.org/) 

• HIC-Up database of PDB ligands (http://xray.bmc.uu.se/hicup/) 

• Structural Classification of Proteins database 

• CATH Protein Structure Classification 

• List of transmembrane proteins with known 3D structure (http://blanco.biomol.uci. 
edu/Membrane_Proteins_xtal.html) 

• Orientations of Proteins in Membranes database 

Structural validation 

• WHAT-IF structural validation suite (http://swift.cmbi.kun.n1/WIWWWI//) 

• Biotech structural validation suite (http://biotech.ebi.ac.uk/) (formerly ProCheck) 

• MolProbity structural validation suite (http://molprobity.biochem.duke.edu/) 

• ProSA-web (https://prosa.services.came.sbg.ac.at/prosa.php) 

• NQ-Flipper (https://flipper.services.came.sbg.ac.at/) (check for unfavorable rotamers 
of Asn and Gin residues) 

• DALI server (http://www.ebi.ac.uk/dali/) (identifies proteins similar to a given 
protein) 
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DNA molecular modeling 



X-ray diffraction 



1. REDIRECT X-ray scattering techniques 



X-ray microscopy 



1. REDIRECT X-ray microscope 



Electron microscopy 



1. REDIRECT electron microscope 

Vibrational circular dichroism 

Vibrational circular dichroism (VCD) spectroscopy is basically circular dichroism 
spectroscopy in the infrared and near infrared ranges^ J . Because VCD is sensitive to the 
mutual orientation of distinct groups in a molecule, it provides three-dimensional structural 
information. Thus, it is a powerful technique as VCD spectra of enantiomers can be 
simulated using ab initio calculations, thereby allowing the identification of absolute 
configurations of small molecules in solution from VCD spectra. Among such quantum 
computations of VCD spectra resulting from the chiral properties of small organic 
molecules are those based on density functional theory (DFT) and gauge-invariant atomic 
orbitals (GIAO). As a simple example of the experimental results that were obtained by VCD 
are the spectral data obtained within the carbon-hydrogen (C-H) stretching region of 21 
amino acids in heavy water solutions. Measurements of vibrational optical activity (VOA) 
have thus numerous applications, not only for small molecules, but also for large and 
complex biopolymers such as muscle proteins (myosin, for example) and DNA. 

Vibrational modes 
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Agitation moleculaire en milieu aqueux 



VCD of peptides and proteins 

Extensive VCD studies have been reported for both polypeptides and several proteins in 
solution^ ] [ ] [ ] ; several recent reviews were also compiled [5] [ ] [7] [ . An extensive but 
not comprehensive VCD publications list is also provided in the "References" section. The 
published reports over the last 22 years have established VCD as a powerful technique with 
improved results over those previously obtained by visible/UV circular dichroism (CD) or 
optical rotatory dispersion (ORD) for proteins and nucleic acids. 

Amino acid and polypeptide structures 
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VCD of nucleic acids 

VCD spectra of nucleotides, synthetic polynucleotides and several nucleic acids, including 
DNA, have been reported and assigned in terms of the type and number of helices present 
in A- , B-, and Z- DNA. 

VCD Instrumentation 

For biopolymers such as proteins and nucleic acids, the difference in absorbance between 
the levo- and dextro- configurations is five orders of magnitude smaller than the 
corresonding (unpolarized) absorbance. Therefore, VCD of biopolymers requires the use of 
very sensitive, specially built instrumentation as well as time-averaging over relatively long 
intervals of time even with such sensitive VCD spectrometers. Most CD instruments 
produce left- and right- circularly polarized light which is then either sine-wave or 
square-wave modulated, with subsequent phase-sensitive detection and lock-in 
amplification of the detected signal. In the case of FT-VCD, a photo-elastic modulator (PEM) 
is employed in conjunction with an FT-IR interferometer set-up. An example is that of a 
Bomem model MB-100 FT-IR interferometer equipped with additional polarizing optics/ 
accessories needed for recording VCD spectra. A parallel beam emerges through a side 
port of the interferometer which passes first through a wire grid linear polarizer and then 
through an octagonal-shaped ZnSe crystal PEM which modulates the polarized beam at a 
fixed, lower frequency such as 37.5 kHz. A mechanically stressed crystal such as ZnSe 
exhibits birefringence when stressed by an adjacent piezoelectric transducer. The linear 
polarizer is positioned close to, and at 45 degrees, with respect to the ZnSe crystal axis. 
The polarized radiation focused onto the detector is doubly modulated, both by the PEM 
and by the interferometer setup. A very low noise detector, such as MCT (HgCdTe), is also 
selected for the VCD signal phase-sensitive detection. Quasi-complete commercial FT-VCD 
instruments are also available from a few manufacturers but these are quite expensive and 
also have to be still considered as being at the prototype stage. To prevent detector 
saturation an appropriate, long wave pass filter is placed before the very low noise MCT 
detector, which allows only radiation below 1750 cm" to reach the MCT detector; the latter 
however measures radiation only down to 750 cm" 1 . FT-VCD spectra accumulation of the 
selected sample solution is then carried out, digitized and stored by an in-line computer. 
Published reviews that compare various VCD methods are also available. 9] [10] 
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Magnetic VCD 

VCD spectra have also been reported in the presence of an applied external magnetic 
fiekr J . This method can enhance the VCD spectral resolution for small molecules^ J L J 

[14] [15] [16] ^ 

Raman optical activity (ROA) 

ROA is a technique complementary to VCD especially useful in the 50—1600 cm" spectral 
region; it is considered as the technique of choice for determining optical activity for 
photon energies less then 600 cm" . 



Notes 

[1] http://planetphysics.org/?op=getobj;from=objects;id=410 Principles of IR and NIR Spectroscopy 

[2] *"Vibrational Circular Dichroism of Polypeptides XII. Re-evaluation of the Fourier Transform Vibrational 

Circular Dichroism of Poly-gamma-Benzyl-L-Glutamate," P. Malon, R. Kobrinskaya, T. A. Keiderling, 

Biopolymers 27, 733-746 (1988). 
[3] *"Vibrational Circular Dichroism of Biopolymers/ 1 T. A. Keiderling, S. C. Yasui, U. Narayanan, A. Annamalai, P. 

Malon, R. Kobrinskaya, L. Yang, in Spectroscopy of Biological Molecules New Advances ed. E. D. Schmid, F. W. 

Schneider, F. Siebert, p. 73-76 (1988). 
[4] *"Vibrational Circular Dichroism of Polypeptides and Proteins," S. C. Yasui, T. A. Keiderling, Mikrochimica 

Acta, II, 325-327, (1988). 
[5] *"Vibrational Circular Dichroism of Proteins Polysaccharides and Nucleic Acids" T. A. Keiderling, Chapter 8 in 

Physical Chemistry of Food Processes, Vol. 2 Advanced Techniques, Structures and Applications., eds. I.C. 

Baianu, H. Pessen, T. Kumosinski, Van Norstrand-Reinhold, New York (1993), pp 307-337. 
[6] "Spectroscopic characterization of Unfolded peptides and proteins studied with infrared absorption and 

vibrational circular dichroism spectra" T. A. Keiderling and Qi Xu, Advances in Protein Chemistry Volume 62, 

[Unfolded Proteins, Dedicated to John Edsall, Ed.: George Rose, Academic Press: New York] (2002), pp. 111-161. 
[7] *"Protein and Peptide Secondary Structure and Conformational Determination with Vibrational Circular 

Dichroism " Timothy A. Keiderling, Current Opinions in Chemical Biology (Ed. Julie Leary and Mark Arnold) 6, 

682-688 (2002). 
[8] *Review: Conformational Studies of Peptides with Infrared Techniques. Timothy A. Keiderling and R. A. G. D. 

Silva, in Synthesis of Peptides and Peptidomimetics, Ed. M. Goodman and G. Herrman, Houben-Weyl, Vol 22Eb, 

Georg Thiem Verlag, New York (2002) pp. 715-738, (written and accepted in 2000). 
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[9] "Polarization Modulation Fourier Transform Infrared Spectroscopy with Digital SignalProcessing: Comparison 

of Vibrational Circular Dichroism Methods." Jovencio Hilario, DavidDrapcho, Raul Curbelo, Timothy A. 

Keiderling, Applied Spectroscopy 55, 1435-1447(2001)- 
[10] "Vibrational circular dichroism of biopolymers. Summary of methods and applications.", Timothy A. 

Keiderling, Jan Kubelka, Jovencio Hilario, in Vibrational spectroscopy of polymers and biological systems, Ed. 

Mark Braiman, Vasilis Gregoriou, Taylor&Francis, Atlanta (CRC Press, Boca Raton, FL) (2006) pp. 253-324 

(originally written in 2000, updated in 2003) 
[11] "Observation of Magnetic Vibrational Circular Dichroism," T. A. Keiderling, Journal of Chemical Physics, 75, 

3639-41 (1981). 
[12] "Vibrational Spectral Assignment and Enhanced Resolution Using Magnetic Vibrational Circular Dichroism," 

T. R. Devine and T. A. Keiderling, Spectrochimica Acta, 43A, 627-629 (1987). 
[13] "Magnetic Vibrational Circular Dichroism with an FTIR" P. V. Croatto, R. K. Yoo, T. A. Keiderling, SPIE 

Proceedings 1145 (7th International Conference on FTS, ed. D. G. Cameron) 152-153 (1989). 
[14] "Direct Measurement of the Rotational g-Value in the Ground State of Acetylene by Magnetic Vibrational 

Circular Dichroism." C. N. Tarn and T. A. Keiderling, Chemical Physics Letters, 243, 55-58 (1995). 
[15] . "Ab initio calculation of the vibrational magnetic dipole moment" P. Bour, C. N. Tarn, T. A. Keiderling, 

Journal of Physical Chemistry 99, 17810-17813 (1995) 
[16] "Rotationally Resolved Magnetic Vibrational Circular Dichroism. Experimental Spectra and Theoretical 

Simulation for Diamagnetic Molecules." P. Bour, C. N. Tarn, B. Wang, T. A. Keiderling, Molecular Physics 87, 

299-318, (1996). 
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Nuclear magnetic resonance 

spectroscopy, most commonly known 
as NMR spectroscopy, is the name 
given to a technique which exploits the 
magnetic properties of certain nuclei. 
This phenomenon and its origins are 
detailed in a separate section on nuclear 
magnetic resonance. The most 
important applications for the organic 
chemist are proton NMR and carbon- 13 
NMR spectroscopy. In principle, NMR is 
applicable to any nucleus possessing 
spin. 

Many types of information can be 
obtained from an NMR spectrum. Much 
like using infrared spectroscopy to 
identify functional groups, analysis of a 
ID NMR spectrum provides information 
on the number and type of chemical 
entities in a molecule. However, NMR 
provides much more information than 
IR. 




A 900MHz NMR instrument with a 21.2 T magnet at 
HWB-NMR, Birmingham, UK, being loaded with a sample. 



The impact of NMR spectroscopy on the natural sciences has been substantial. It can, 
among other things, be used to study mixtures of analytes, to understand dynamic effects 
such as change in temperature and reaction mechanisms, and is an invaluable tool in 
understanding protein and nucleic acid structure and function. It can be applied to a wide 
variety of samples, both in the solution and the solid state. 
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Basic NMR techniques 

When placed in a magnetic field, NMR active nuclei 
(such as H or C) absorb at a frequency 
characteristic of the isotope. The resonant 
frequency, energy of the absorption and the 
intensity of the signal are proportional to the 
strength of the magnetic field. For example, in a 21 
tesla magnetic field, protons resonate at 900 MHz. 
It is common to refer to a 21 T magnet as a 900 
MHz magnet, although different nuclei resonate at a 
different frequency at this field strength. 

In the Earth's magnetic field the same nuclei 
resonate at audio frequencies. This effect is used in 
Earth's field NMR spectrometers and other 
instruments. Because these instruments are 
portable and inexpensive, they are often used for 
teaching and field work. 

Chemical shift 




The NMR sample is prepared in a 
thin-walled glass tube - an NMR tube. 



Depending on the local chemical environment, different protons in a molecule resonate at 
slightly different frequencies. Since both this frequency shift and the fundamental resonant 
frequency are directly proportional to the strength of the magnetic field, the shift is 
converted into a field -independent dimensionless value known as the chemical shift. The 
chemical shift is reported as a relative measure from some reference resonance frequency. 
(For the nuclei H, C, and Si, TMS (tetramethylsilane) is commonly used as a 
reference.) This difference between the frequency of the signal and the frequency of the 
reference is divided by frequency of the reference signal to give the chemical shift. The 
frequency shifts are extremely small in comparison to the fundamental NMR frequency. A 
typical frequency shift might be 100 Hz, compared to a fundamental NMR frequency of 100 
MHz, so the chemical shift is generally expressed in parts per million (ppm). J 

By understanding different chemical environments, the chemical shift can be used to obtain 
some structural information about the molecule in a sample. The conversion of the raw data 
to this information is called assigning the spectrum. For example, for the H-NMR spectrum 
for ethanol (CH CH OH), one would expect three specific signals at three specific chemical 
shifts: one for the CH group, one for the CH group and one for the OH group. A typical 
CH group has a shift around 1 ppm, a CH attached to an OH has a shift of around 4 ppm 
and an OH has a shift around 2-3 ppm depending on the solvent used. 

Because of molecular motion at room temperature, the three methyl protons average out 
during the course of the NMR experiment (which typically requires a few ms). These 
protons become degenerate and form a peak at the same chemical shift. 

The shape and size of peaks are indicators of chemical structure too. In the example 
above— the proton spectrum of ethanol— the CH peak would be three times as large as the 
OH. Similarly the CH peak would be twice the size of the OH peak but only 2/3 the size of 
the CH peak. 
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Modern analysis software allows analysis of the size of peaks to understand how many 
protons give rise to the peak. This is known as integration— a mathematical process which 
calculates the area under a graph (essentially what a spectrum is). The analyst must 
integrate the peak and not measure its height because the peaks also have width— and thus 
its size is dependent on its area not its height. However, it should be mentioned that the 
number of protons, or any other observed nucleus, is only proportional to the intensity, or 
the integral, of the NMR signal, in the very simplest one-dimensional NMR experiments. In 
more elaborate experiments, for instance, experiments typically used to obtain carbon- 13 
NMR spectra, the integral of the signals depends on the relaxation rate of the nucleus, and 
its scalar and dipolar coupling constants. Very often these factors are poorly understood - 
therefore, the integral of the NMR signal is very difficult to interpret in more complicated 
NMR experiments. 

J-coupling 



Multiplicity 


Intensity Ratio 


Singlet (s) 


1 


Doublet (d) 


1:1 


Triplet (t) 


1:2:1 


Quartet (q) 


1:3:3:1 


Quintet 


1:4:6:4:1 


Sextet 


1:5:10:10:5:1 


Septet 


1:6:15:20:15:6:1 



Some of the most useful information for structure determination in a one-dimensional NMR 
spectrum comes from J-coupling or scalar coupling (a special case of spin-spin coupling) 
between NMR active nuclei. This coupling arises from the interaction of different spin 
states through the chemical bonds of a molecule and results in the splitting of NMR signals. 
These splitting patterns can be complex or simple and, likewise, can be straightforwardly 
interpretable or deceptive. This coupling provides detailed insight into the connectivity of 
atoms in a molecule. 

Coupling to n equivalent (spin Vi) nuclei splits the signal into a n + 1 multiplet with 
intensity ratios following Pascal's triangle as described on the right. Coupling to additional 
spins will lead to further splittings of each component of the multiplet e.g. coupling to two 
different spin Vi nuclei with significantly different coupling constants will lead to a doublet 
of doublets (abbreviation: dd). Note that coupling between nuclei that are chemically 
equivalent (that is, have the same chemical shift) has no effect of the NMR spectra and 
couplings between nuclei that are distant (usually more than 3 bonds apart for protons in 
flexible molecules) are usually too small to cause observable splittings. Long-range 
couplings over more than three bonds can often be observed in cyclic and aromatic 
compounds, leading to more complex splitting patterns. 

For example, in the proton spectrum for ethanol described above, the CH group is split 
into a triplet with an intensity ratio of 1:2:1 by the two neighboring CH protons. Similarly, 
the CH is split into a quartet with an intensity ratio of 1:3:3:1 by the three neighboring 
CH protons. In principle, the two CH 2 protons would also be split again into a doublet to 
form a doublet of quartets by the hydroxyl proton, but intermolecular exchange of the 
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acidic hydroxyl proton often results in a loss of coupling information. 

Coupling to any spin Vi nuclei such as phosphorus-31 or fluorine- 19 works in this fashion 
(although the magnitudes of the coupling constants may be very different). But the splitting 
patterns differ from those described above for nuclei with spin greater than Vi because the 
spin quantum number has more than two possible values. For instance, coupling to 
deuterium (a spin 1 nucleus) splits the signal into a 1:1:1 triplet because the spin 1 has 
three spin states. Similarly, a spin 3/2 nucleus splits a signal into a 1:1:1:1 quartet and so 
on. 

Coupling combined with the chemical shift (and the integration for protons) tells us not only 
about the chemical environment of the nuclei, but also the number of neighboring NMR 
active nuclei within the molecule. In more complex spectra with multiple peaks at similar 
chemical shifts or in spectra of nuclei other than hydrogen, coupling is often the only way 
to distinguish different nuclei. 

Second-order (or strong) coupling 

The above description assumes that the coupling constant is small in comparison with the 
difference in NMR frequencies between the inequivalent spins. If the shift separation 
decreases (or the coupling strength increases), the multiplet intensity patterns are first 
distorted, and then become more complex and less easily analyzed (especially if more than 
two spins are involved). Intensification of some peaks in a multiplet is achieved at the 
expense of the remainder, which sometimes almost disappear in the background noise, 
although the integrated area under the peaks remains constant. In most high-field NMR, 
however, the distortions are usually modest and the characteristic distortions (roofing) can 
in fact help to identify related peaks. 

Second-order effects decrease as the frequency difference between multiplets increases, so 
that high-field (i.e. high-frequency) NMR spectra display less distortion than lower 
frequency spectra. Early spectra at 60 MHz were more prone to distortion than spectra 
from later machines typically operating at frequencies at 200 MHz or above. 

Magnetic inequivalence 

More subtle effects can occur if chemically equivalent spins (i.e. nuclei related by symmetry 
and so having the same NMR frequency) have different coupling relationships to external 
spins. Spins that are chemically equivalent but are not indistinguishable (based on their 
coupling relationships) are termed magnetically inequivalent. For example, the 4 H sites of 
1,2-dichlorobenzene divide into two chemically equivalent pairs by symmetry, but an 
individual member of one of the pairs has different couplings to the spins making up the 
other pair. Magnetic inequivalence can lead to highly complex spectra which can only be 
analyzed by computational modeling. Such effects are more common in NMR spectra of 
aromatic and other non-flexible systems, while conformational averaging about C-C bonds 
in flexible molecules tends to equalize the couplings between protons on adjacent carbons, 
reducing problems with magnetic inequivalence. 
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Correlation spectroscopy 

Correlation spectroscopy is one of several types of two-dimensional nuclear magnetic 
resonance (NMR) spectroscopy. This type of NMR experiment is best known by its 
acronym, COSY. Other types of two-dimensional NMR include J-spectroscopy, exchange 
spectroscopy (EXSY), Nuclear Overhauser effect spectroscopy (NOESY), total correlation 
spectroscopy (TOCSY) and heteronuclear correlation experiments, such as HSQC, HMQC, 
and HMBC. Two-dimensional NMR spectra provide more information about a molecule than 
one-dimensional NMR spectra and are especially useful in determining the structure of a 
molecule, particularly for molecules that are too complicated to work with using 
one-dimensional NMR. The first two-dimensional experiment, COSY, was proposed by Jean 
Jeener, a professor at Universite Libre de Bruxelles, in 1971. This experiment was later 
implemented by Walter P. Aue, Enrico Bartholdi and Richard R. Ernst, who published their 
work in 1976. [2] 

Solid-state nuclear magnetic resonance 

A variety of physical circumstances does not allow molecules to be studied in solution, and 
at the same time not by other spectroscopic techniques to an atomic level, either. In 
solid-phase media, such as crystals, microcrystalline powders, gels, anisotropic solutions, 
etc., it is in particular the dipolar coupling and chemical shift anisotropy that become 
dominant to the behaviour of the nuclear spin systems. In conventional solution-state NMR 
spectroscopy, these additional interactions would lead to a significant broadening of 
spectral lines. A variety of techniques allows to establish high-resolution conditions, that 
can, at least for C spectra, be comparable to solution-state NMR spectra. 

Two important concepts for high-resolution solid-state NMR spectroscopy are the limitation 
of possible molecular orientation by sample orientation, and the reduction of anisotropic 
nuclear magnetic interactions by sample spinning. Of the latter approach, fast spinning 
around the magic angle is a very prominent method, when the system comprises spin 1/2 
nuclei. A number of intermediate techniques, with samples of partial alignment or reduced 
mobility, is currently being used in NMR spectroscopy. 

Applications in which solid-state NMR effects occur are often related to structure 
investigations on membrane proteins, protein fibrils or all kinds of polymers, and chemical 
analysis in inorganic chemistry, but also include "exotic" applications like the plant leaves 
and fuel cells. 

NMR spectroscopy applied to proteins 

Much of the recent innovation within NMR spectroscopy has been within the field of protein 
NMR, which has become a very important technique in structural biology. One common 
goal of these investigations is to obtain high resolution 3-dimensional structures of the 
protein, similar to what can be achieved by X-ray crystallography. In contrast to X-ray 
crystallography, NMR is primarily limited to relatively small proteins, usually smaller than 
35 kDa, though technical advances allow ever larger structures to be solved. NMR 
spectroscopy is often the only way to obtain high resolution information on partially or 
wholly intrinsically unstructured proteins. 

Proteins are orders of magnitude larger than the small organic molecules discussed earlier 
in this article, but the same NMR theory applies. Because of the increased number of each 
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element present in the molecule, the basic ID spectra become crowded with overlapping 
signals to an extent where analysis is impossible. Therefore, multidimensional (2, 3 or 4D) 
experiments have been devised to deal with this problem. To facilitate these experiments, it 

1 O 1 r 

is desirable to isotopically label the protein with C and N because the predominant 
naturally occurring isotope 12 C is not NMR-active, whereas the nuclear quadrupole moment 
of the predominant naturally occurring N isotope prevents high resolution information to 
be obtained from this nitrogen isotope. The most important method used for structure 
determination of proteins utilizes NOE experiments to measure distances between pairs of 
atoms within the molecule. Subsequently, the obtained distances are used to generate a 3D 
structure of the molecule using a computer program. 

See also 

In vivo magnetic resonance spectroscopy 

Low field NMR 

Magnetic Resonance Imaging 

Nuclear Magnetic Resonance 

NMR spectra database 

NMR tube - includes sample preparation 

Protein nuclear magnetic resonance spectroscopy 
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• Protein NMR- A Practical Guide (http://www.protein-nmr.org.uk) Practical guide to 
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Spectroscopy" (reprinted at University of Cambridge). University of California, Irvine. 
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• The Basics of NMR (http://www.cis.rit.edu/htbooks/nmr/) - A non-technical overview 
of NMR theory, equipment, and techniques by Dr. Joseph Hornak, Professor of Chemistry 
atRIT 

• NMRWiki.ORG (http://nmrwiki.org) project, a Wiki dedicated to NMR, MRI, and EPR. 

• NMR spectroscopy for organic chemistry (http://www.organicworldwide.net/nmr. 
html) 

• The Spectral Game (http://spectralgame.com) NMR spectroscopy game. 

Free NMR processing, analysis and simulation software 

• WINDNMR-Pro (http://www.chem.wisc.edu/areas/reich/plt/windnmr.htm) - 
simulation software for interactive calculation of first and second-order spin-coupled 
multiplets and a variety of DNMR lineshapes. 

• CARA (http://www.nmr.ch) - resonance assignment software developed at the Wiithrich 
group 
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• NMRShiftDB (http://www.nmrshiftdb.org) - open database and NMR prediction 
website 

• Spinworks (http://www.umanitoba.ca/chemistry/nmr/spinworks/) 
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1. REDIRECTNuclear magnetic resonance 

NIR 



The three-letter acronym NIR may stand for 

• Near Infrared, infrared radiation of a wave length near that of red 

• Northern Ireland, a region of the United Kingdom 

• Northern Ireland Railways 

• National Identity Register 

• National Internet Registry 

• Nir, one of two cities with that name in Iran 

• Nir, a Hebrew word meaning 'a plowed field' 

• Nir, a common Hebrew name 



Spectral 



Spectral imaging is a branch of spectroscopy in which a complete spectrum or some 
spectral information (such as the Doppler shift or Zeeman splitting of a spectral line) is 
collected at every location in an image plane. Applications include astronomy, solar physics, 
analysis of plasmas in nuclear fusion experiments, planetology, and Earth remote sensing. 

This is often referred to as hyperspectral imaging. 



See also 

• Dopplergraph 
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Hyperspectral 

Hyperspectral imaging collects and processes information from across the 
electromagnetic spectrum. Unlike the human eye, which just sees visible light, 
hyperspectral imaging is more like the eyes of the mantis shrimp, which can see visible 
light as well as from the ultraviolet to infrared. Hyperspectral capabilities enable the 
mantis shrimp to recognize different types of coral, prey, or predators, all which may 
appear as the same color to the human eye. 

Humans build sensors and processing systems to provide the same type of capability for 
application in agriculture, mineralogy, physics, and surveillance. Hyperspectral sensors 
look at objects using a vast portion of the electromagnetic spectrum. Certain objects leave 
unique 'fingerprints' across the electromagnetic spectrum. These 'fingerprints' are known 
as spectral signatures and enable identification of the materials that make up a scanned 
object. For example, having the spectral signature for oil helps mineralogists find new oil 
fields. 

Acquisition and Analysis 

Hyperspectral sensors collect information as a set of 
'images'. Each image represents a range of the 
electromagnetic spectrum and is also known as a 
spectral band. These 'images' are then combined and 
form a three dimensional hyperspectral cube for 
processing and analysis. 

Hyperspectral cubes are generated from airborne 
sensors like the NASA's Airborne Visible/Infrared 
Imaging Spectrometer (AVIRIS), or from satellites like 
NASA's Hyperion. J However, for many development 
and validation studies handheld sensors are used. ] 

The precision of these sensors is typically measured in 
spectral resolution, which is the width of each band of 
the spectrum that is captured. If the scanner picks up on a large number of fairly narrow 
frequency bands, it is possible to identify objects even if said objects are only captured in a 
handful of pixels. However, spatial resolution is a factor in addition to spectral resolution. If 
the pixels are too large, then multiple objects are captured in the same pixel and become 
difficult to identify. If the pixels are too small, then the energy captured by each sensor-cell 
is low, and the decreased signal-to-noise ratio reduces the reliability of measured features. 

MicroMSI, Opticks and Envi are three remote sensing applications that support the 
processing and analysis of hyperspectral data. The acquisition and processing of 
hyperspectral images is also referred to as imaging spectroscopy. 
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Differences between Hyperspectral and Multispectral 



Hyperspectral Imaging is part of a class of techniques 
commonly referred to as spectral imaging or spectral 
analysis. Hyperspectral Imaging is related to 
multispectral imaging. The distinction between 
hyperspectral and multispectral is usually defined as 
the number of spectral bands. Multispectral data 
contains from tens to hundreds of bands. Hyperspectral 
data contains hundreds to thousands of bands. 
However, hyperspectral imaging may be best defined 
by the manner in which the data is collected. 
Hyperspectral data is a set of contiguous bands (usually 
by one sensor). Multispectral is a set of optimally 
chosen spectral bands that are typically not contiguous 
and can be collected from multiple sensors. 



Multispectral/ 

>erspectral Comparison 




Hyperspectral and Multispectral 
Differences. 



Applications 

Hyperspectral remote sensing is used in a wide array of real-life applications. Although 
originally developed for mining and geology (The ability of hyperspectral imaging to 
identify various minerals makes it ideal for the mining and oil industries, where it can be 
used to look for ore and oil L J L J it has now spread into fields as wide-spread as ecology and 
surveillance, as well as historical manuscript research such as the imaging of the 
Archimedes Palimpsest. This technology is continually becoming more available to the 
public, and has been used in a wide variety of ways. Organizations such as NASA and the 
USGS have catalogues of various minerals and their spectral signatures, and have posted 
them online to make them readily available for researchers. 

Agriculture 

Although the costs of acquiring hyperspectral images is typically high, for specific crops 
and in specific climates hyperspectral remote sensing is used more and more for 
monitoring the development and health of crops. In Australia work is underway to use 
imaging spectrometers to detect grape variety, and develop an early warning system for 
disease outbreaks. J Furthermore work is underway to use hyperspectral data to detect the 
chemical composition of plants [5] which can be used to detect the nutrient and water status 
of wheat in irrigated systems^ J 



Mineralogy 

The original field of development for hyperspectral remote sensing, hyperspectral sensing 
of minerals is now well developed. Many minerals can be identified from images, and their 
relation to the presence of valuable minerals such as gold and diamonds is well understood. 
Currently the move is towards understanding the relation between oil and gas leakages 
from pipelines and natural wells; their effect on the vegetation and the spectral signatures. 
Recent work includes the PhD dissertations of Werff 7] and Noomen [ ] . 
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Physics 

Physicists use an electron microscopy technique that involves microanalysis using either 
Energy dispersive X-ray spectroscopy (EDS), Electron energy loss spectroscopy (EELS), 
Infrared Spectroscopy(IR), Raman Spectroscopy, or cathodoluminescence (CL) 
spectroscopy, in which the entire spectrum measured at each point is recorded. EELS 
hyperspectral imaging is performed in a scanning transmission electron microscope 
(STEM); EDS and CL mapping can be performed in STEM as well, or in a scanning electron 
microscope or electron probe microanalyzer (EPMA). Often, multiple techniques (EDS, 
EELS, CL) are used simultaneously. 

In a "normal" mapping experiment, an image of the sample will be made that is simply the 
intensity of a particular emission mapped in an XY raster. For example, an EDS map could 
be made of a steel sample, in which iron x-ray intensity is used for the intensity grayscale of 
the image. Dark areas in the image would indicate not-iron-bearing impurities. This could 
potentially give misleading results; if the steel contained tungsten inclusions, for example, 
the high atomic number of tungsten could result in bremsstrahlung radiation that made the 
iron-free areas appear to be rich in iron. 

By hyperspectral mapping, instead, the entire spectrum at each mapping point is acquired, 
and a quantitative analysis can be performed by computer post-processing of the data, and 
a quantitative map of iron content produced. This would show which areas contained no 
iron, despite the anomalous x-ray counts caused by bremsstrahlung. Because EELS 
core-loss edges are small signals on top of a large background, hyperspectral imaging 
allows large improvements to the quality of EELS chemical maps. 

Similarly, in CL mapping, small shifts in the peak emission energy could be mapped, which 
would give information regarding slight chemical composition changes or changes in the 
stress state of a sample. 

Surveillance 

Hyperspectral surveillance is the implementation of hyperspectral scanning technology for 
surveillance purposes. Hyperspectral imaging is particularly useful in military surveillance 
because of measures that military entities now take to avoid airborne surveillance. Airborne 
surveillance has been in effect since soldiers used tethered balloons to spy on troops during 
the American Civil War, and since that time we have learned not only to hide from the 
naked eye, but to mask our heat signature to blend in to the surroundings and avoid 
infrared scanning, as well. The idea that drives hyperspectral surveillance is that 
hyperspectral scanning draws information from such a large portion of the light spectrum 
that any given object should have unique spectral signature in at least a few of the many 
bands that get scanned. 1] 
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Advantages and Disadvantages 

The primary advantages to hyperspectral imaging is that, because an entire spectrum is 
acquired at each point, the operator needs no a priori knowledge of the sample, and 
post-processing allows all available information from the dataset to be mined. 

The primary disadvantages are cost and complexity. Fast computers, sensitive detectors, 
and large data storage capacities are needed for analyzing hyperspectral data. Significant 
data storage capacity is necessary since hyperspectral cubes are large multi-dimensional 
datasets, potentially exceeding hundreds of megabytes. All of these factors greatly increase 
the cost of acquiring and processing hyperspectral data. Also, one of the hurdles that 
researchers have had to face is finding ways to program hyperspectral satellites to sort 
through data on their own and transmit only the most important images, as both 
transmission and storage of that much data could prove difficult and costly. J As a 
relatively new analytical technique, the full potential of hyperspectral imaging has not yet 
been realized. 

See also 

• Airborne Real-time Cueing Hyperspectral Enhanced Reconnaissance 

• Full Spectral Imaging 

• Multi-spectral image 

• Chemical imaging 

• Remote Sensing 

• Sensor fusion 

External Links 

ITT Visual Information Solutions - ENVI Hyperspectral Image Processing Software L J 
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Chemical imaging 

Chemical imaging is the simultaneous measurement of spectra (chemical information) 
and images or pictures (spatial information)^ [2] The technique is most often applied to 
either solid or gel samples, and has applications in chemistry, biology^ J L J L J L J L J L J , 
medicine [9] [10] , pharmacy [11] (see also for example: Chemical Imaging Without Dyeing 
[ ] ), food science, biotechnology^ ] [ , agriculture and industry (see for example:NIR 
Chemical Imaging in Pharmaceutical Industry [15] and Pharmaceutical Process Analytical 
Technology: L J ). NIR, IR and Raman chemical imaging is also referred to as hyperspectral, 
spectroscopic, spectral or multispectral imaging (also see microspectroscopy). However, 
other ultra-sensitive and selective, chemical imaging techniques are also in use that involve 
either UV-visible or fluorescence microspectroscopy. Chemical imaging techniques can be 
used to analyze samples of all sizes, from the single molecule^ J L J to the cellular level in 
biology and medicine [19] [20] [21] , and to images of planetary systems in astronomy, but 
different instrumentation is employed for making observations on such widely different 
systems. 

Chemical imaging instrumentation is composed of three components: a radiation source to 
illuminate the sample, a spectrally selective element, and usually a detector array (the 
camera) to collect the images. When many stacked spectral channels (wavelengths) are 
collected for different locations of the microspectrometer focus on a line or planar array in 
the focal plane, the data is called hyperspectral; fewer wavelength data sets are called 
multispectral. The data format is called a hypercube. The data set may be visualized as a 
three-dimensional block of data spanning two spatial dimensions (x and y), with a series of 
wavelengths (lambda) making up the third (spectral) axis. The hypercube can be visually 
and mathematically treated as a series of spectrally resolved images (each image plane 
corresponding to the image at one wavelength) or a series of spatially resolved spectra. The 
analyst may choose to view the spectrum measured at a particular spatial location; this is 
useful for chemical identification. Alternatively, selecting an image plane at a particular 
wavelength can highlight the spatial distribution of sample components, provided that their 
spectral signatures are different at the selected wavelength. 

Many materials, both manufactured and naturally occurring, derive their functionality from 
the spatial distribution of sample components. For example, extended release 
pharmaceutical formulations can be achieved by using a coating that acts as a barrier layer. 
The release of active ingredient is controlled by the presence of this barrier, and 
imperfections in the coating, such as discontinuities, may result in altered performance. In 
the semi-conductor industry, irregularities or contaminants in silicon wafers or printed 
micro-circuits can lead to failure of these components. The functionality of biological 
systems is also dependent upon chemical gradients - a single cell, tissue, and even whole 
organs function because of the very specific arrangement of components. It has been shown 
that even small changes in chemical composition and distribution may be an early indicator 
of disease. 

Any material that depends on chemical gradients for functionality may be amenable to 
study by an analytical technique that couples spatial and chemical characterization. To 
efficiently and effectively design and manufacture such materials, the 'what' and the 
'where' must both be measured. The demand for this type of analysis is increasing as 
manufactured materials become more complex. Chemical imaging techniques not only 
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permit visualization of the spatially resolved chemical information that is critical to 
understanding modern manufactured products, but it is also a non-destructive technique so 
that samples are preserved for further testing. 

History 

Commercially available laboratory-based chemical imaging systems emerged in the early 
1990s (ref. 1-5). In addition to economic factors, such as the need for sophisticated 
electronics and extremely high-end computers, a significant barrier to commercialization of 
infrared imaging was that the focal plane array (FPA) needed to read IR images were not 
readily available as commercial items. As high-speed electronics and sophisticated 
computers became more commonplace, and infrared cameras became readily commercially 
available, laboratory chemical imaging systems were introduced. 

Initially used for novel research in specialized laboratories, chemical imaging became a 
more commonplace analytical technique used for general R&D, quality assurance (QA) and 
quality control (QC) in less than a decade. The rapid acceptance of the technology in a 
variety of industries (pharmaceutical, polymers, semiconductors, security, forensics and 
agriculture) rests in the wealth of information characterizing both chemical composition 
and morphology. The parallel nature of chemical imaging data makes it possible to analyze 
multiple samples simultaneously for applications that require high throughput analysis in 
addition to characterizing a single sample. 

Principles 

Chemical imaging shares the fundamentals of vibrational spectroscopic techniques, but 
provides additional information by way of the simultaneous acquisition of spatially resolved 
spectra. It combines the advantages of digital imaging with the attributes of spectroscopic 
measurements. Briefly, vibrational spectroscopy measures the interaction of light with 
matter. Photons that interact with a sample are either absorbed or scattered; photons of 
specific energy are absorbed, and the pattern of absorption provides information, or a 
fingerprint, on the molecules that are present in the sample. 

On the other hand, in terms of the observation setup, chemical imaging can be carried out 
in one of the following modes: (optical) absorption, emission (fluorescence), (optical) 
transmission or scattering (Raman). A consensus currently exists that the fluorescence 
(emission) and Raman scattering modes are the most sensitive and powerful, but also the 
most expensive. 

In a transmission measurement, the radiation goes through a sample and is measured by a 
detector placed on the far side of the sample. The energy transferred from the incoming 
radiation to the molecule(s) can be calculated as the difference between the quantity of 
photons that were emitted by the source and the quantity that is measured by the detector. 
In a diffuse reflectance measurement, the same energy difference measurement is made, 
but the source and detector are located on the same side of the sample, and the photons 
that are measured have re-emerged from the illuminated side of the sample rather than 
passed through it. The energy may be measured at one or multiple wavelengths; when a 
series of measurements are made, the response curve is called a spectrum. 

A key element in acquiring spectra is that the radiation must somehow be energy selected - 
either before or after interacting with the sample. Wavelength selection can be 
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accomplished with a fixed filter, tunable filter, spectrograph, an interferometer, or other 
devices. For a fixed filter approach, it is not efficient to collect a significant number of 
wavelengths, and multispectral data are usually collected. Interferometer-based chemical 
imaging requires that entire spectral ranges be collected, and therefore results in 
hyperspectral data. Tunable filters have the flexibility to provide either multi- or 
hyperspectral data, depending on analytical requirements. 

Spectra may be measured one point at a time using a single element detector (single-point 
mapping), as a line-image using a linear array detector (typically 16 to 28 pixels) (linear 
array mapping), or as a two-dimensional image using a Focal Plane Array (FPA) (typically 
256 to 16,384 pixels) (FPA imaging). For single-point the sample is moved in the x and y 
directions point-by-point using a computer-controlled stage. With linear array mapping, the 
sample is moved line-by-line with a computer-controlled stage. FPA imaging data are 
collected with a two-dimensional FPA detector, hence capturing the full desired 
field-of-view at one time for each individual wavelength, without having to move the 
sample. FPA imaging, with its ability to collected tens of thousands of spectra 
simultaneously is orders of magnitude faster than linear arrays which are can typically 
collect 16 to 28 spectra simultaneously, which are in turn much faster than single-point 
mapping. 

Terminology 

Some words common in spectroscopy, optical microscopy and photography have been 
adapted or their scope modified for their use in chemical imaging. They include: resolution, 
field of view and magnification. There are two types of resolution in chemical imaging. The 
spectral resolution refers to the ability to resolve small energy differences; it applies to the 
spectral axis. The spatial resolution is the minimum distance between two objects that is 
required for them to be detected as distinct objects. The spatial resolution is influenced by 
the field of view, a physical measure of the size of the area probed by the analysis. In 
imaging, the field of view is a product of the magnification and the number of pixels in the 
detector array. The magnification is a ratio of the physical area of the detector array 
divided by the area of the sample field of view. Higher magnifications for the same detector 
image a smaller area of the sample. 

Types of vibrational chemical imaging instruments 

Chemical imaging has been implemented for mid-infrared, near-infrared spectroscopy and 
Raman spectroscopy. As with their bulk spectroscopy counterparts, each imaging technique 
has particular strengths and weaknesses, and are best suited to fulfill different needs. 

Mid-infrared chemical imaging 

Mid-infrared (MIR) spectroscopy probes fundamental molecular vibrations, which arise in 
the spectral range 2,500-25,000 nm. Commercial imaging implementations in the MIR 
region typically employ Fourier Transform Infrared (FT-IR) interferometers and the range is 
more commonly presented in wavenumber, 4,000 - 400 cm" . The MIR absorption bands 
tend to be relatively narrow and well-resolved; direct spectral interpretation is often 
possible by an experienced spectroscopist. MIR spectroscopy can distinguish subtle 
changes in chemistry and structure, and is often used for the identification of unknown 
materials. The absorptions in this spectral range are relatively strong; for this reason, 
sample presentation is important to limit the amount of material interacting with the 
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incoming radiation in the MIR region. Most data collected in this range is collected in 
transmission mode through thin sections (-10 micrometres) of material. Water is a very 
strong absorber of MIR radiation and wet samples often require advanced sampling 
procedures (such as attenuated total reflectance). Commercial instruments include point 
and line mapping, and imaging. All employ an FT-IR interferometer as wavelength selective 
element and light source. 




Remote chemical imaging of a simultaneous release of SF and 
NH at 1.5km using the FIRST imaging spectrometer 



For types of MIR microscope, see 
Microscopy#infrared microscopy. 

Atmospheric windows in the 

infrared spectrum are also 

employed to perform chemical 

imaging remotely. In these 

spectral regions the atmospheric 

gases (mainly water and CO ) 

present low absorption and allow 

infrared viewing over kilometer 

distances. Target molecules can then be viewed using the selective absorption/emission 

processes described above. An example of the chemical imaging of a simultaneous release 

of SF and NH is shown in the image. 

Near-infrared chemical imaging 

The analytical near infrared (NIR) region spans the range from approximately 700-2,500 
nm. The absorption bands seen in this spectral range arise from overtones and combination 
bands of O-H, N-H, C-H and S-H stretching and bending vibrations. Absorption is one to two 
orders of magnitude smaller in the NIR compared to the MIR; this phenomenon eliminates 
the need for extensive sample preparation. Thick and thin samples can be analyzed without 
any sample preparation, it is possible to acquire NIR chemical images through some 
packaging materials, and the technique can be used to examine hydrated samples, within 
limits. Intact samples can be imaged in transmittance or diffuse reflectance. 

The lineshapes for overtone and combination bands tend to be much broader and more 
overlapped than for the fundamental bands seen in the MIR. Often, multivariate methods 
are used to separate spectral signatures of sample components. NIR chemical imaging is 
particularly useful for performing rapid, reproducible and non-destructive analyses of 
known materials^ J L . NIR imaging instruments are typically based on one of two 
platforms: imaging using a tunable filter and broad band illumination, and line mapping 
employing an FT-IR interferometer as the wavelength filter and light source. 

Raman chemical imaging 

The Raman shift chemical imaging spectral range spans from approximately 50 to 4,000 
cm" ; the actual spectral range over which a particular Raman measurement is made is a 
function of the laser excitation frequency. The basic principle behind Raman spectroscopy 
differs from the MIR and NIR in that the x-axis of the Raman spectrum is measured as a 
function of energy shift (in cm" ) relative to the frequency of the laser used as the source of 
radiation. Briefly, the Raman spectrum arises from inelastic scattering of incident photons, 
which requires a change in polarizability with vibration, as opposed to infrared absorption, 
which requires a change in dipole moment with vibration. The end result is spectral 
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information that is similar and in many cases complementary to the MIR. The Raman effect 
is weak - only about one in 10 photons incident to the sample undergoes Raman scattering. 
Both organic and inorganic materials possess a Raman spectrum; they generally produce 
sharp bands that are chemically specific. Fluorescence is a competing phenomenon and, 
depending on the sample, can overwhelm the Raman signal, for both bulk spectroscopy and 
imaging implementations. 

Raman chemical imaging requires little or no sample preparation. However, physical 
sample sectioning may be used to expose the surface of interest, with care taken to obtain a 
surface that is as flat as possible. The conditions required for a particular measurement 
dictate the level of invasiveness of the technique, and samples that are sensitive to high 
power laser radiation may be damaged during analysis. It is relatively insensitive to the 
presence of water in the sample and is therefore useful for imaging samples that contain 
water such as biological material. 

Fluorescence imaging (visible and NIR) 

This emission microspectroscopy mode is the most sensitive in both visible and FT-NIR 
microspectroscopy, and has therefore numerous biomedical, biotechnological and 
agricultural applications. There are several powerful, highly specific and sensitive 
fluorescence techniques that are currently in use, or still being developed; among the 
former are FLIM, FRAP, FRET and FLIM-FRET; among the latter are NIR fluorescence and 
probe-sensitivity enhanced NIR fluorescence microspectroscopy and nanospectroscopy 
techniques (see "Further reading" section). 

Sampling and samples 

The value of imaging lies in the ability to resolve spatial heterogeneities in solid-state or 
gel/gel-like samples. Imaging a liquid or even a suspension has limited use as constant 
sample motion serves to average spatial information, unless ultra-fast recording techniques 
are employed as in fluorescence correlation microspectroscopy or FLIM obsevations where 
a single molecule may be monitored at extremely high (photon) detection speed. 
High-throughput experiments (such as imaging multi-well plates) of liquid samples can 
however provide valuable information. In this case, the parallel acquisition of thousands of 
spectra can be used to compare differences between samples, rather than the more 
common implementation of exploring spatial heterogeneity within a single sample. 

Similarly, there is no benefit in imaging a truly homogeneous sample, as a single point 
spectrometer will generate the same spectral information. Of course the definition of 
homogeneity is dependent on the spatial resolution of the imaging system employed. For 
MIR imaging, where wavelengths span from 3-10 micrometres, objects on the order of 5 
micrometres may theoretically be resolved. The sampled areas are limited by current 
experimental implementations because illumination is provided by the interferometer. 
Raman imaging may be able to resolve particles less than 1 micrometre in size, but the 
sample area that can be illuminated is severely limited. With Raman imaging, it is 
considered impractical to image large areas and, consequently, large samples. FT-NIR 
chemical/hyperspectral imaging usually resolves only larger objects (>10 micrometres), 
and is better suited for large samples because illumination sources are readily available. 
However, FT-NIR microspectroscopy was recently reported to be capable of about 1.2 
micron (micrometer) resolution in biological samples^ J Furthermore, two-photon 



Chemical imaging 130 

excitation FCS experiments were reported to have attained 15 nanometer resolution on 
biomembrane thin films with a special coincidence photon-counting setup. 

Detection limit 

The concept of the detection limit for chemical imaging is quite different than for bulk 
spectroscopy, as it depends on the sample itself. Because a bulk spectrum represents an 
average of the materials present, the spectral signatures of trace components are simply 
overwhelmed by dilution. In imaging however, each pixel has a corresponding spectrum. If 
the physical size of the trace contaminant is on the order of the pixel size imaged on the 
sample, its spectral signature will likely be detectable. If however, the trace component is 
dispersed homogeneously (relative to pixel image size) throughout a sample, it will not be 
detectable. Therefore, detection limits of chemical imaging techniques are strongly 
influenced by particle size, the chemical and spatial heterogeneity of the sample, and the 
spatial resolution of the image. 

Data analysis 

Data analysis methods for chemical imaging data sets typically employ mathematical 
algorithms common to single point spectroscopy or to image analysis. The reasoning is that 
the spectrum acquired by each detector is equivalent to a single point spectrum; therefore 
pre-processing, chemometrics and pattern recognition techniques are utilized with the 
similar goal to separate chemical and physical effects and perform a qualitative or 
quantitative characterization of individual sample components. In the spatial dimension, 
each chemical image is equivalent to a digital image and standard image analysis and 
robust statistical analysis can be used for feature extraction. 

See also 

• Multispectral image 

• Microspectroscopy 

• Imaging spectroscopy 
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Raman 



Raman may refer to 

• Chandrasekhara Venkata Raman, Indian physicist and Nobel laureate, discoverer of 
Raman scattering. See also Raman spectroscopy 

• Raman Sundrum, theoretical physicist famous for the Randall-Sundrum model 

• Raman (crater), lunar crater named after C.V. Raman 

• Amphoe Raman, district in Yala Province, southern Thailand 

• Raman, Punjab, India 

• Raman, Rawalpindi, a village in Pakistani Punjab 

• Raman spectroscopy, a spectroscopic technique used in condensed matter physics and 
chemistry 

• Woorkeri Raman, Indian cricketer 

• Ramen, a Japanese dish 
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Fluorescence correlation 
spectroscopy 



Fluorescence correlation spectroscopy (FCS) is a common technique used by physicists, 
chemists, and biologists to experimentally characterize the dynamics of fluorescent species 
(e.g. single fluorescent dye molecules in nanostructured materials, autofluorescent proteins 
in living cells, etc.). Although the name indicates a specific link to fluorescence, the method 
is used today also for exploring other forms of luminescence (like reflections, luminescence 
from gold-beads or quantum dots or phosphorescent species). The "spectroscopy" in the 
name is not readily found as in common usage a spectrum is generally understood to be a 
frequency spectrum. The autocorrelation is a genuine form of spectrum, however: It is the 
time-spectrum generated from the power spectrum (via inverse fourier transform). 

Commonly, FCS is employed in the context of optical microscopy, in particular confocal or 
two photon microscopy. In these techniques light is focused on a sample and the measured 
fluorescence intensity fluctuations (due to diffusion, physical or chemical reactions, 
aggregation, etc.) are analyzed using the temporal autocorrelation. Because the measured 
property is essentially related to the magnitude and/or the amount of fluctuations, there is 
an optimum measurement regime at the level when individual species enter or exit the 
observation volume (or turn on and off in the volume). When too many entities are 
measured at the same time the overall fluctuations are small in comparison to the total 
signal and may not be resolvable - in the other direction, if the individual fluctuation-events 
are too sparse in time, one measurement may take prohibitively too long. FCS is in a way 
the fluorescent counterpart to dynamic light scattering, which uses coherent light 
scattering, instead of (incoherent) fluorescence. 

When an appropriate model is known, FCS can be used to obtain quantitative information 
such as 

• diffusion coefficients 

• hydrodynamic radii 

• average concentrations 

• kinetic chemical reaction rates 

• singlet-triplet dynamics 

Because fluorescent markers come in a variety of colors and can be specifically bound to a 
particular molecule (e.g. proteins, polymers, metal-complexes, etc.), it is possible to study 
the behavior of individual molecules (in rapid succession in composite solutions). With the 
development of sensitive detectors such as avalanche photodiodes the detection of the 
fluorescence signal coming from individual molecules in highly dilute samples has become 
practical. With this emerged the possibility to conduct FCS experiments in a wide variety of 
specimens, ranging from materials science to biology. The advent of engineered cells with 
genetically tagged proteins (like green fluorescent protein) has made FCS a common tool 
for studying molecular dynamics in living cells. 
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History 

Signal-correlation techniques have first been experimentally applied to fluorescence in 
1972 by Magde, Elson, and Webb L J , who are therefore commonly credited as the 
"inventors" of FCS. The technique was further developed in a group of papers by these and 
other authors soon after, establishing the theoretical foundations and types of 
applications. ] [ ] [ ] See Thompson (1991) [5] for a review of that period. 

Beginning in 1993 [ ] , a number of improvements in the measurement techniques-notably 
using confocal microscopy, and then two photon microscopy-to better define the 
measurement volume and reject background greatly improved the signal-to-noise and 
allowed single molecule sensitivity. J L J Since then, there has been a renewed interest in 
FCS, and as of August 2007 there has been over 3,000 papers using FCS found in Web of 
Science. See Krichevsky and Bonner J for a recent review. In addition, there has been a 
flurry of activity extending FCS in various ways, for instance to laser scanning and spinning 
disk confocal microscopy (from a stationary, single point measurement), in using 
cross-correlation (FCCS) between two fluorescent channels instead of autocorrelation, and 
in using Forster Resonance Energy Transfer (FRET) instead of fluorescence. 

Typical FCS setup 

The typical FCS setup consists of a laser line (wavelengths ranging typically from 405 - 633 
nm (cw), and from 690 - 1100 nm (pulsed)), which is reflected into a microscope objective 
by a dichroic mirror. The laser beam is focused in the sample, which contains fluorescent 
particles (molecules) in such high dilution, that only few are within the focal spot (usually 1 
- 100 molecules in one fL). When the particles cross the focal volume, they fluoresce. This 
light is collected by the same objective and, because it is red-shifted with respect to the 
excitation light it passes the dichroic reaching a detector, typically a photomultiplier tube 
or avalanche photodiode detector. The resulting electronic signal can be stored either 
directly as an intensity versus time trace to be analyzed at a later point, or, computed to 
generate the autocorrelation directly (which requires special acquisition cards). The FCS 
curve by itself only represents a time-spectrum. Conclusions on physical phenomena have 
to be extracted from there with appropriate models. The parameters of interest are found 
after fitting the autocorrelation curve to modeled functional forms. [10] The setup is shown 
in Figure 1 . 

The Measurement Volume 

The measurement volume is a convolution of illumination (excitation) and detection 
geometries, which result from the optical elements involved. The resulting volume is 
described mathematically by the point spread function (or PSF), it is essentially the image 
of a point source. The PSF is often described as an ellipsoid (with unsharp boundaries) of 
few hundred nanometers in focus diameter, and almost one micrometre along the optical 
axis. The shape varies significantly (and has a large impact on the resulting FCS curves) 
depending on the quality of the optical elements (it is crucial to avoid astigmatism and to 
check the real shape of the PSF on the instrument). In the case of confocal microscopy, and 
for small pinholes (around one Airy unit), the PSF is well approximated by Gaussians: 

PSF(t, z) = I oe - 2r2 /<e- 2 ^ 
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where Iqis the peak intensity, r and z are radial and axial position, and ^'^and ^'zare the 
radial and axial radii, and w * ^ ^xy. This Gaussian form is assumed in deriving the 
functional form of the autocorrelation. 

Typically ^yis 200-300 nm, and ^Ms 2-6 times larger. J One common way of calibrating 
the measurement volume parameters is to perform FCS on a species with known diffusion 
coefficient and concentration (see below). Diffusion coefficients for common fluorophores in 
water are given in a later section. 

The Gaussian approximation works to varying degrees depending on the optical details, and 
corrections can sometimes be applied to offset the errors in approximation. 12] 

Autocorrelation Function 

The (temporal) autocorrelation function is the correlation of a time series with itself shifted 
by time ~, as a function of t : 

(SI(t)6I(t + T)} (I(t)I(t + r)) _ 
7{l) - {I(t))z </(*))■ 

where 5I(t) = I(t) — (I(t)) is the deviation from the mean intensity. The normalization 
(denominator) here is the most commonly used for FCS, because then the correlation at 
r = 0, G(0), is related to the average number of particles in the measurement volume. 

Interpreting the Autocorrelation Function 

To extract quantities of interest, the autocorrelation data can be fitted, typically using a 
nonlinear least squares algorithm. The fit's functional form depends on the type of 
dynamics (and the optical geometry in question). 

Normal Diffusion 

The fluorescent particles used in FCS are small and thus experience thermal motions in 
solution. The simplest FCS experiment is thus normal 3D diffusion, for which the 
autocorrelation is: 

°M - G <°> (TTC^JKiWa^ + G(oo) 

where & = ^z/^xy is the ratio of axial to radial e _2 radii of the measurement volume, and 
^.Dis the characteristic residence time. This form was derived assuming a Gaussian 
measurement volume. Typically, the fit would have three free parameters--G(O), G(oc) / and 
t d --from which the diffusion coefficient and fluorophore concentration can be obtained. 
With the normalization used in the previous section, G(0) gives the mean number of 
diffusers in the volume <N>, or equivalently-with knowledge of the observation volume 
size-the mean concentration: 

1 1 

G{0) = TN^ = V eff <C>> 

where the effective volume is found from integrating the Gaussian form of the 
measurement volume and is given by: 

t^ J/2. 2 . , 

t d gives the diffusion coefficient: D = u: x „/4t"£). 
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Anomalous diffusion 

If the diffusing particles are hindered by obstacles or pushed by a force (molecular motors, 
flow, etc.) the dynamics is often not sufficiently well-described by the normal diffusion 
model, where the mean squared displacement (MSD) grows linearly with time. Instead the 
diffusion may be better described as anomalous diffusion, where the temporal dependenc of 
the MSD is non-linear as in the power-law: 

MSD = 6D a t a 

where D a is an anomalous diffusion coefficient. "Anomalous diffusion" commonly refers 

only to this very generic model, and not the many other possibilities that might be 

described as anomalous. Also, a power law is, in a strict sense, the expected form only for a 

narrow range of rigorously defined systems, for instance when the distribution of obstacles 

is fractal. Nonetheless a power law can be a useful approximation for a wider range of 

systems. 

The FCS autocorrelation function for anomalous diffusion is: 

0(r) - G(Q) fl + (T/TD)a){l \ a - a(T/Tp)a)1/2 + C(co) , 

where the anomalous exponent ft is the same as above, and becomes a free parameter in 
the fitting. 

Using FCS, the anomalous exponent has been shown to be an indication of the degree of 
molecular crowding (it is less than one and smaller for greater degrees of crowdings . 

Polydisperse diffusion 

If there are diffusing particles with different sizes (diffusion coefficients), it is common to fit 
to a function that is the sum of single component forms: 

where the sum is over the number different sizes of particle, indexed by i, and a % gives the 
weighting, which is related to the quantum yield and concentration of each type. This 
introduces new parameters, which makes the fitting more difficult as a higher dimensional 
space must be searched. Nonlinear least square fitting typically becomes unstable with 
even a small number of r °- i s. A more robust fitting scheme, especially useful for 
polydisperse samples, is the Maximum Entropy Method^ J . 

Diffusion with flow 

With diffusion together with a uniform flow with velocity i-in the lateral direction, the 
autocorrelation is 1 - J : 

G(t)=G(0) wiL-2^ ^/2 Xe M-(r/<fx [ l+g(oo) 

(1 + {t/t d ))(1 + a 2 {t/t d ))V 2 1 + rJT D 

where t v = uj xy /v is the average residence time if there is only a flow (no diffusion). 
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Chemical relaxation 

A wide range of possible FCS experiments involve chemical reactions that continually 
fluctuate from equilibrium because of thermal motions (and then "relax"). In contrast to 
diffusion, which is also a relaxation process, the fluctuations cause changes between states 
of different energies. One very simple system showing chemical relaxation would be a 
stationary binding site in the measurement volume, where particles only produce signal 
when bound (e.g. by FRET, or if the diffusion time is much faster than the sampling 
interval). In this case the autocorrelation is: 

G(t) = G(0) exp(-r/r s ) + G(oo) 
where 

r B = (k on + k ff) 

is the relaxation time and depends on the reaction kinetics (on and off rates), and: 

is related to the equilibrium constant K. 

Most systems with chemical relaxation also show measureable diffusion as well, and the 
autocorrelation function will depend on the details of the system. If the diffusion and 
chemical reaction are decoupled, the combined autocorrelation is the product of the 
chemical and diffusive autocorrelations. 

Triplet State Correction 

The autocorrelations above assume that the fluctuations are not due to changes in the 
fluorescent properties of the particles. However, for the majority of (bio)organic 
fluorophores-e.g. green fluorescent protein, rhodamine, Cy3 and Alexa Fluor dyes-some 
fraction of illuminated particles are excited to a triplet state (or other non-radiative 
decaying states) and then do not emit photons for a characteristic relaxation time r f. 
Typically T Fis on the order of microseconds, which is usually smaller than the dynamics of 
interest (e.g. T n ) but large enough to be measured. A multiplicative term is added to the 
autocorrelation account for the triplet state. For normal diffusion: 

1-F + Fe.- T/TF 1 
G(r) = G(Q) —— — — — — - + G(oo) 

where Fis the fraction of particles that have entered the triplet state and T Pis the 
corresponding triplet state relaxation time. If the dynamics of interest are much slower 
than the triplet state relaxation, the short time component of the autocorrelation can simply 
be truncated and the triplet term is unnecessary. 

Common fluorescent probes 

The fluorescent species used in FCS is typically a biomolecule of interest that has been 
tagged with a fluorophore (using immunohistochemistry for instance), or is a naked 
fluorophore that is used to probe some environment of interest (e.g. the cytoskeleton of a 
cell). The following table gives diffusion coefficients of some common fluorophores in water 
at room temperature, and their excitation wavelengths. 



Fluorescent dye „ m :> i Excitation 

y D (*10 10 m 2 s 1 ) _ _ , . 

wavelength (nm) 



Reference 
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Rhodamine 6G 


2.8, 3.0, 4.14 ± 0.05 @ 25.00 °C 


514 


[16] [17] [18] 


Rhodamine 110 


2.7 


488 


[19] 


Tetramethyl rhodamine 


2.6 


543 




Cy3 


2.8 


543 




Cy5 


2.5, 3.7 ± 0.15 @ 25.00 °C 


633 


[20] [21] 


carboxyfluorescein 


3.2 


488 




Alexa-488 


1.96 


488 


[22] 


Atto655-maleimide 


4.07 ± 0.1 @ 25.00 °C 


663 


[23] 


Atto655-carboxylicacid 


4.26 ± 0.08(a) 25.00 °C 


663 


[24] 


2[], 7G-difluorofluorescein 
(Oregon Green488) 


4.11 ± 0.06 @ 25.00 °C 


498 


[25] 



Variations of FCS 

FCS almost always refers to the single point, single channel, temporal autocorrelation 
measurement, although the term "fluorescence correlation spectroscopy" out of its 
historical scientific context implies no such restriction. FCS has been extended in a number 
of variations by different researchers, with each extension generating another name 
(usually an acronym). 



Fluorescence Cross-Correlation Spectroscopy (FCCS) 

FCS is sometimes used to study molecular interactions using differences in diffusion times 
(e.g. the product of an association reaction will be larger and thus have larger diffusion 
times than the reactants individually); however, FCS is relatively insensitive to molecular 
mass as can be seen from the following equation relating molecular mass to the diffusion 
time of globular particles (e.g. proteins): 



*D = 



2kT 



iMy 



where 7 ?is the viscosity of the sample and Mis the molecular mass of the fluorescent 
species. In practice, the diffusion times need to be sufficiently different-a factor of at least 
1.6-which means the molecular masses must differ by a factor of 4. J Dual color 
fluorescence cross-correlation spectroscopy (FCCS) measures interactions by 
cross-correlating two or more fluorescent channels (one channel for each reactant), which 
distinguishes interactions more sensitively than FCS, particularly when the mass change in 
the reaction is small. 
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Two- and three- photon FCS excitation 

Several advantages in both spatial resolution and minimizing photodamage/photobleaching 
in organic and/or biological samples are obtained by two-photon or three-photon excitation 

FC g[27] [28] [29] [30] [31] 

FRET-FCS 

Another FCS based approach to studying molecular interactions uses fluorescence 
resonance energy transfer (FRET) instead of fluorescence, and is called FRET-FCS. J With 
FRET, there are two types of probes, as with FCCS; however, there is only one channel and 
light is only detected when the two probes are very close-close enough to ensure an 
interaction. The FRET signal is weaker than with fluorescence, but has the advantage that 
there is only signal during a reaction (aside from autofluorescence). 

Image Correlation Spectroscopy (ICS) 

When the motion is slow (in biology, for example, diffusion in a membrane), getting 
adequate statistics from a single-point FCS experiment may take a prohibitively long time. 
More data can be obtained by performing the experiment in multiple spatial points in 
parallel, using a laser scanning confocal microscope. This approach has been called Image 
Correlation Spectroscopy (ICS) [ ] . The measurements can then be averaged together. 

Another variation of ICS performs a spatial autocorrelation on images, which gives 
information about the concentration of particles [34] . The correlation is then averaged in 
time. 

A natural extension of the temporal and spatial correlation versions is spatio-temporal ICS 
(STICS) [35] . In STICS there is no explicit averaging in space or time (only the averaging 
inherent in correlation). In systems with non-isotropic motion (e.g. directed flow, 
asymmetric diffusion), STICS can extract the directional information. A variation that is 
closely related to STICS (by the Fourier transform) is k-space Image Correlation 
Spectroscopy (kICS). [36] 

There are cross-correlation versions of ICS as well. 33] 

Scanning FCS variations 

Some variations of FCS are only applicable to serial scanning laser microscopes. Image 
Correlation Spectroscopy and its variations all were implemented on a scanning confocal or 
scanning two photon microscope, but transfer to other microscopes, like a spinning disk 
confocal microscope. Raster ICS (RICS) [37] , and position sensitive FCS (PSFCS) [38] 
incorporate the time delay between parts of the image scan into the analysis. Also, low 
dimensional scans (e.g. a circular ring) [ ] -only possible on a scanning system-can access 
time scales between single point and full image measurements. Scanning path has also 
been made to adaptively follow particles. ] 
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Spinning disk FCS, and spatial mapping 

Any of the image correlation spectroscopy methods can also be performed on a spinning 
disk confocal microscope, which in practice can obtain faster imaging speeds compared to a 
laser scanning confocal microscope. This approach has recently been applied to diffusion in 
a spatially varying complex environment, producing a pixel resolution map of diffusion 
coefficient. J . The spatial mapping of diffusion with FCS has subsequently been extended 
to TIRF system. ] Spatial mapping of dynamics using correlation techniques had been 
applied before, but only at sparse points^ ] or at coarse resolution^ ] . 

Total internal reflection FCS 

Total internal reflection fluorescence (TIRF) is a microscopy approach that is only sensitive 
to a thin layer near the surface of a coverslip, which greatly minimizes background 
fluorscence. FCS has been extended to that type of microscope, and is called TIR-FCS [ ] . 
Because the fluorescence intensity in TIRF falls off exponentially with distance from the 
coverslip (instead of as a Gaussian with a confocal), the autocorrelation function is 
different. 

Other fluorescent dynamical approaches 

There are two main non-correlation alternatives to FCS that are widely used to study the 
dynamics of fluorescent species. 

Fluorescence recovery after photobleaching (FRAP) 

In FRAP, a region is briefly exposed to intense light, irrecoverably photobleaching 
fluorophores, and the fluorescence recovery due to diffusion of nearby (non-bleached) 
fluorophores is imaged. A primary advantage of FRAP over FCS is the ease of interpreting 
qualitative experiments common in cell biology. Differences between cell lines, or regions 
of a cell, or before and after application of drug, can often be characterized by simple 
inspection of movies. FCS experiments require a level of processing and are more sensitive 
to potentially confounding influences like: rotational diffusion, vibrations, photobleaching, 
dependence on illumination and fluorescence color, inadequate statistics, etc. It is much 
easier to change the measurement volume in FRAP, which allows greater control. In 
practice, the volumes are typically larger than in FCS. While FRAP experiments are 
typically more qualitative, some researchers are studying FRAP quantitatively and including 
binding dynamics. ] A disadvantage of FRAP in cell biology is the free radical perturbation 
of the cell caused by the photobleaching. It is also less versatile, as it cannot measure 
concentration or rotational diffusion, or co-localization. FRAP requires a significantly higher 
concentration of fluorophores than FCS. 

Particle tracking 

In particle tracking, the trajectories of a set of particles are measured, typically by applying 
particle tracking algorithms to movies. [46] Particle tracking has the advantage that all the 
dynamical information is maintained in the measurement, unlike FCS where correlation 
averages the dynamics to a single smooth curve. The advantage is apparent in systems 
showing complex diffusion, where directly computing the mean squared displacement 
allows straightforward comparison to normal or power law diffusion. To apply particle 
tracking, the particles have to be distinguishable and thus at lower concentration than 



Fluorescence correlation spectroscopy 142 

required of FCS. Also, particle tracking is more sensitive to noise, which can sometimes 
affect the results unpredictably. 
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See also 

• Confocal microscopy 

• Fluorescence cross-correlation spectroscopy 

• FRET 

• Dynamic light scattering 

• Diffusion coefficient 

External links 

• Single-molecule spectroscopic methods (http://dx.doi.Org/10.1016/j.sbi.2004.09. 
004) 

• FCS Classroom (http://www.fcsxpert.com/classroom) 

Molecular models of DNA 

Molecular models of DNA structures are representations of the molecular geometry and 
topology of Deoxyribonucleic acid (DNA) molecules using one of several means, such as: 
closely packed spheres (CPK models) made of plastic, metal wires for 'skeletal models', 
graphic computations and animations by computers, artistic rendering, and so on, with the 
aim of simplifying and presenting the essential, physical and chemical, properties of DNA 
molecular structures either in vivo or in vitro. Computer molecular models also allow 
animations and molecular dynamics simulations that are very important for understanding 
how DNA functions in vivo. Thus, an old standing dynamic problem is how DNA 
"self-replication" takes place in living cells that should involve transient uncoiling of 
supercoiled DNA fibers. Although DNA consists of relatively rigid, very large elongated 
biopolymer molecules called "fibers" or chains (that are made of repeating nucleotide units 
of four basic types, attached to deoxyribose and phosphate groups), its molecular structure 
in vivo undergoes dynamic configuration changes that involve dynamically attached water 
molecules and ions. Supercoiling, packing with histones in chromosome structures, and 
other such supramolecular aspects also involve in vivo DNA topology which is even more 
complex than DNA molecular geometry, thus turning molecular modeling of DNA into an 
especially challenging problem for both molecular biologists and biotechnologists. Like 
other large molecules and biopolymers, DNA often exists in multiple stable geometries (that 
is, it exhibits conformational isomerism) and configurational, quantum states which are 
close to each other in energy on the potential energy surface of the DNA molecule. Such 
geometries can also be computed, at least in principle, by employing ab initio quantum 
chemistry methods that have high accuracy for small molecules. Such quantum geometries 
define an important class of ab initio molecular models of DNA whose exploration has 
barely started. 

In an interesting twist of roles, the DNA molecule itself was proposed to 
be utilized for quantum computing. Both DNA nanostructures as well as 
DNA 'computing' biochips have been built (see biochip image at right). 

The more advanced, computer-based molecular models of DNA involve 
molecular dynamics simulations as well as quantum mechanical 
computations of vibro-rotations, delocalized molecular orbitals (MOs), 
electric dipole moments, hydrogen-bonding, and so on. 




DNA computing 
biochip: 3D 
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Importance 

From the very early stages of structural studies of DNA by X-ray 
diffraction and biochemical means, molecular models such as the 
Watson-Crick double-helix model were successfully employed to solve the 
'puzzle' of DNA structure, and also find how the latter relates to its key 
functions in living cells. The first high quality X-ray diffraction patterns 
of A-DNA were reported by Rosalind Franklin and Raymond Gosling in 
1953 L . The first calculations of the Fourier transform of an atomic helix 
were reported one year earlier by Cochran, Crick and Vand [2] , and were 
followed in 1953 by the computation of the Fourier transform of a 
coiled-coil by Crick [3] . The first reports of a double-helix molecular 
model of B-DNA structure were made by Watson and Crick in 1953 L J L . 
Last-but-not-least, Maurice F. Wilkins, A. Stokes and H.R. Wilson, 
reported the first X-ray patterns of in vivo B-DNA in partially oriented 
salmon sperm heads [6] . The development of the first correct 
double-helix molecular model of DNA by Crick and Watson may not have 

been possible without the biochemical evidence for the nucleotide base-pairing ([A— T]; 

[C-G]), or Chargaff s rules [7] [8] [9] [10] [11] [12] . 




Spinning DNA 
generic model. 



Examples of DNA molecular models 

Animated molecular models allow one to visually explore the three-dimensional (3D) 
structure of DNA. The first DNA model is a space-filling, or CPK, model of the DNA 
double-helix whereas the third is an animated wire, or skeletal type, molecular model of 
DNA. The last two DNA molecular models in this series depict quadruplex DNA L J that 
may be involved in certain cancers [ ] [ . The last figure on this panel is a molecular 
model of hydrogen bonds between water molecules in ice that are similar to those found in 
DNA. 
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Hydrogen 




• Spacefilling model or CPK model - a molecule is represented by overlapping spheres 
representing the atoms. 
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DNA Spacefilling molecular model 



Images for DNA Structure Determination from X-Ray 
Patterns 

The following images illustrate both the principles and the main steps involved in 
generating structural information from X-ray diffraction studies of oriented DNA fibers with 
the help of molecular models of DNA that are combined with crystallographic and 
mathematical analysis of the X-ray patterns. From left to right the gallery of images shows: 

• First row: 

• 1. Constructive X-ray interference, or diffraction, following Bragg's Law of X-ray 
"reflection by the crystal planes"; 

• 2. A comparison of A-DNA (crystalline) and highly hydrated B-DNA (paracrystalline) X-ray 
diffraction, and respectively, X-ray scattering patterns (courtesy of Dr. Herbert R. Wilson, 
FRS- see refs. list); 

• 3. Purified DNA precipitated in a water jug; 

• 4. The major steps involved in DNA structure determination by X-ray crystallography 
showing the important role played by molecular models of DNA structure in this iterative, 
structure-determination process; 

• Second row: 
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• 5. Photo of a modern X-ray diffractometer employed for recording X-ray patterns of DNA 
with major components: X-ray source, goniometer, sample holder, X-ray detector and/or 
plate holder; 

• 6. Illustrated animation of an X-ray goniometer; 

• 7. X-ray detector at the SLAC synchrotron facility; 

• 8. Neutron scattering facility at ISIS in UK; 

• Third and fourth rows: Molecular models of DNA structure at various scales; figure 
#11 is an actual electron micrograph of a DNA fiber bundle, presumably of a single 
bacterial chromosome loop. 
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Paracrystalline lattice models of B-DNA structures 

A paracrystalline lattice, or paracrystal, is a molecular or atomic lattice with significant 
amounts (e.g., larger than a few percent) of partial disordering of molecular 
arranegements. Limiting cases of the paracrystal model are nanostructures, such as 
glasses, liquids, etc., that may possess only local ordering and no global order. Liquid 
crystals also have paracrystalline rather than crystalline structures. 
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DNA Helix controversy in 1952 
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Highly hydrated B-DNA occurs naturally in living cells in such a paracrystalline state, which 
is a dynamic one in spite of the relatively rigid DNA double-helix stabilized by parallel 
hydrogen bonds between the nucleotide base-pairs in the two complementary, helical DNA 
chains (see figures). For simplicity most DNA molecular models ommit both water and ions 
dynamically bound to B-DNA, and are thus less useful for understanding the dynamic 
behaviors of B-DNA in vivo. The physical and mathematical analysis of X-ray 1 - J L J and 
spectroscopic data for paracrystalline B-DNA is therefore much more complicated than that 
of crystalline, A-DNA X-ray diffraction patterns. The paracrystal model is also important for 
DNA technological applications such as DNA nanotechnology. Novel techniques that 
combine X-ray diffraction of DNA with X-ray microscopy in hydrated living cells are now 
also being developed (see, for example, "Application of X-ray microscopy in the analysis of 
living hydrated cells" [18] ). 

Genomic and Biotechnology Applications of DNA molecular 
modeling 

The following gallery of images illustrates various uses of DNA molecular modeling in 
Genomics and Biotechnology research applications from DNA repair to PCR and DNA 
nanostructures; each slide contains its own explanation and/or details. The first slide 
presents an overview of DNA applications, including DNA molecular models, with emphasis 
on Genomics and Biotechnology. 

Gallery: DNA Molecular modeling applications 
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Databases for DNA molecular models and sequences 



X-ray diffraction 

• NDB ID: UD0017 Database [13] 

• X-ray Atlas -database [19] 

• PDB files of coordinates for nucleic acid structures from X-ray diffraction by NA (incl. 
DNA) crystals [20] 

• Structure factors dowloadable files in CIF format [21] 
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Neutron scattering 

• ISIS neutron source 

• ISIS pulsed neutron source:A world centre for science with neutrons & muons at 
Harwell, near Oxford, UK. [22] 

X-ray microscopy 

• Application of X-ray microscopy in the analysis of living hydrated cells L J 

Electron microscopy 

• DNA under electron microscope [ ] 

Atomic Force Microscopy (AFM) 

Two-dimensional DNA junction arrays have been visualized by Atomic Force Microscopy 
(AFM) L J . Other imaging resources for AFM/Scanning probe microscopy(SPM) can be 
freely accessed at: 

• How SPM Works [25] 

• SPM Image Gallery - AFM STM SEM MFM NSOM and more. [26] 

Gallery of AFM Images 




Sample Surface 
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Spectroscopy 

• Vibrational circular dichroism (VCD) 

• FT-NMR [18] [19] 

• NMR Atlas-database [29] 

• mmcif downloadable coordinate files of nucleic acids in solution from 2D-FT NMR data 

[30] 

• NMR constraints files for NAs in PDB format [31] 
NMR microscopy [20] 
Microwave spectroscopy 
FT-IR 

p T _ NIR [21] [22] [23] 

Spectral, Hyperspectral, and Chemical imaging) [24] [25] [26] [27] [28] [29] [30] . 

Raman spectroscopy/microscopy [ ] and CARS [ ] . 

Fluorescence correlation spectroscopy [33] [34] [35] [36] [37] [38] [39] [40] , Fluorescence 

cross-correlation spectroscopy and FRET [ ] [ ] [ . 

Confocal microscopy^ J 
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Gallery: CARS (Raman spectroscopy), Fluorescence confocal 
microscopy, and Hyperspectral imaging 
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Genomic and structural databases 

• CBS Genome Atlas Database L J — contains examples of base skews. J 

• The Z curve database of genomes — a 3-dimensional visualization and analysis tool of 
genomes [59][46] . 

• DNA and other nucleic acids' molecular models: Coordinate files of nucleic acids 
molecular structure models in PDB and CIF formats L J 



Notes 

[I] Franklin, R.E. and Gosling, R.G. recd.6 March 1953. Acta Cryst. (1953). 6, 673 The Structure of Sodium 
Thymonucleate Fibres I. The Influence of Water Content Acta Cryst. (1953). and 6, 678 The Structure of Sodium 
Thymonucleate Fibres II. The Cylindrically Symmetrical Patterson Function. 

[2] Cochran, W., Crick, F.H.C. and Vand V. 1952. The Structure of Synthetic Polypeptides. 1. The Transform of 

Atoms on a Helix. Acta Cryst. 5(5):581-586. 
[3] Crick, F.H.C. 1953a. The Fourier Transform of a Coiled-Coil., Acta Crystallographica 6(8-9):685-689. 
[4] Watson, J.D; Crick F.H.C. 1953a. Molecular Structure of Nucleic Acids- A Structure for Deoxyribose Nucleic 

Acid., Nature 171(4356):737-738. 
[5] Watson, J.D; Crick F.H.C. 1953b. The Structure of DNA., Cold Spring Harbor Symposia on Qunatitative Biology 

18:123-131. 
[6] Wilkins M.H.F., A.R. Stokes A.R. & Wilson, H.R. (1953). 

"http://www.nature.com/nature/dna50/wilkins.pdflMolecular Structure of Deoxypentose Nucleic Acids" (PDF). 

Nature 171: 738-740. doi: 10.1038/171738a0 (http://dx.doi.org/10.1038/171738a0). PMID 13054693. http:/ 

/www. nature.com/nature/dna50/wilkins.pdf. 
[7] Elson D, Chargaff E (1952). "On the deoxyribonucleic acid content of sea urchin gametes". Expehentia 8 (4): 

143-145. 
[8] Chargaff E, Lipshitz R, Green C (1952). "Composition of the deoxypentose nucleic acids of four genera of 

sea-urchin". J Biol Chem 195 (1): 155-160. PMID 14938364. 
[9] Chargaff E, Lipshitz R, Green C, Hodes ME (1951). "The composition of the deoxyribonucleic acid of salmon 

sperm". J Biol Chem 192 (1): 223-230. PMID 14917668. 
[10] Chargaff E (1951). "Some recent studies on the composition and structure of nucleic acids". J Cell Physiol 

Suppl 38 (Suppl). 

[II] Magasanik B, Vischer E, Doniger R, Elson D, Chargaff E (1950). "The separation and estimation of 
ribonucleotides in minute quantities". J Biol Chem 186 (1): 37-50. PMID 14778802. 

[12] Chargaff E (1950). "Chemical specificity of nucleic acids and mechanism of their enzymatic degradation". 

Expehentia 6 (6): 201-209. 
[13] http ://www. phy . cam. ac. uk/research/bss/molbiophysics . php 
[14] http ://planetphy sics . org/encyclopedia/TheoreticalBiophysics . html 
[15] Hosemann R., Bagchi R.N., Direct analysis of diffraction by matter, North-Holland Pubis., Amsterdam - New 

York, 1962. 
[16] Baianu, I.C. (1978). "X-ray scattering by partially disordered membrane systems.". Acta Cryst., A34 (5): 

751-753. doi: 10.1107/S0567739478001540 (http://dx.doi.org/10.1107/S0567739478001540). 
[17] Mao, Chengde; Sun, Weiqiong & Seeman, Nadrian C. (16 June 1999). "Designed Two-Dimensional DNA 

Holliday Junction Arrays Visualized by Atomic Force Microscopy". Journal of the American Chemical Society 

121 (23): 5437-5443. doi: 10.1021/ja9900398 (http://dx.doi.org/10.1021/ja9900398). ISSN 0002-7863 

(http://worldcat.org/issn/0002-7863). 
[18] (http://www.jonathanpmiller.com/Karplus.html)- obtaining dihedral angles from 3 J coupling constants 



Molecular models of DNA 157 

[19] (http://www.spectroscopynow.com/FCKeditor/UserFiles/File/specNOW/HTML files/ 

General_Karplus_Calculator.htm) Another Javascript-like NMR coupling constant to dihedral 
[20] Lee, S. C. et al., (2001). One Micrometer Resolution NMR Microscopy. J. Magn. Res., 150: 207-213. 
[21] Near Infrared Microspectroscopy, Fluorescence Microspectroscopy, Infrared Chemical Imaging and High 

Resolution Nuclear Magnetic Resonance Analysis of Soybean Seeds, Somatic Embryos and Single Cells., 

Baianu, I.C. et al. 2004., In Oil Extraction and Analysis., D. Luthria, Editor pp. 241-273, AOCS Press., 

Champaign, IL. 
[22] Single Cancer Cell Detection by Near Infrared Microspectroscopy, Infrared Chemical Imaging and 

Fluorescence Microspectroscopy.2004.I. C. Baianu, D. Costescu, N. E. Hofmann and S. S. Korban, 

q-bio/0407006 Quly 2004) (http://arxiv.org/abs/q-bio/0407006) 
[23] Raghavachari, R., Editor. 2001. Near-Infrared Applications in Biotechnology, Marcel-Dekker, New York, NY. 
[24] http://www.imaging.net/chemical-imaging/Chemical imaging 
[25] http://www.malvern.com/LabEng/products/sdi/bibliography/sdi_bibliography.htm E. N. Lewis, E. Lee 

and L. H. Kidder, Combining Imaging and Spectroscopy: Solving Problems with Near-Infrared Chemical 

Imaging. Microscopy Today, Volume 12, No. 6, 11/2004. 
[26] D.S. Mantus and G. H. Morrison. 1991. Chemical imaging in biology and medicine using ion microscopy., 

Microchimica Acta, 104, (1-6) January 1991, doi: 10.1007/BF01245536 
[27] Near Infrared Microspectroscopy, Fluorescence Microspectroscopy, Infrared Chemical Imaging and High 

Resolution Nuclear Magnetic Resonance Analysis of Soybean Seeds, Somatic Embryos and Single Cells., 

Baianu, I.C. et al. 2004., In Oil Extraction and Analysis., D. Luthria, Editor pp. 241-273, AOCS Press., 

Champaign, IL. 
[28] Single Cancer Cell Detection by Near Infrared Microspectroscopy, Infrared Chemical Imaging and 

Fluorescence Microspectroscopy.2004.I. C. Baianu, D. Costescu, N. E. Hofmann and S. S. Korban, 

q-bio/0407006 (July 2004) (http://arxiv.org/abs/q-bio/0407006) 
[29] J. Dubois, G. Sando, E. N. Lewis, Near-Infrared Chemical Imaging, A Valuable Tool for the Pharmaceutical 

Industry, G.I.T. Laboratory Journal Europe, No. 1-2, 2007. 
[30] Applications of Novel Techniques to Health Foods, Medical and Agricultural Biotechnology. Qune 2004)., I. C. 

Baianu, P. R. Lozano, V. I. Prisecaru and H. C. Lin q-bio/0406047 (http://arxiv.org/abs/q-bio/0406047) 
[31] Chemical Imaging Without Dyeing (http://witec.de/en/download/Raman/ImagingMicroscopy04.pdf) 
[32] C.L. Evans and X.S. Xie.2008. Coherent Anti-Stokes Raman Scattering Microscopy: Chemical Imaging for 

Biology and Medicine., doi: 10.1 146/annurev.anchem. 1.031207. 112754 Annual Review of Analytical Chemistry, 

1: 883-909. 
[33] Eigen, M., Rigler, M. Sorting single molecules: application to diagnostics and evolutionary 

biotechnology, (1994) Proc. Natl. Acad. Sci. USA, 91,5740-5747. 
[34] Rigler, M. Fluorescence correlations, single molecule detection and large number screening. Applications in 

biotechnology,(1995) J. Biotechnol., 41,177-186. 
[35] Rigler R. and Widengren J. (1990). Ultrasensitive detection of single molecules by fluorescence correlation 

spectroscopy, BioScience (Ed. Klinge & Owman) p. 180. 
[36] Single Cancer Cell Detection by Near Infrared Microspectroscopy, Infrared Chemical Imaging and 

Fluorescence Microspectroscopy.2004.I. C. Baianu, D. Costescu, N. E. Hofmann, S. S. Korban and et al., 

q-bio/0407006 Quly 2004) (http://arxiv.org/abs/q-bio/0407006) 
[37] Oehlenschlager F., Schwille P. and Eigen M. (1996). Detection of HIV-1 RNA by nucleic acid sequence-based 

amplification combined with fluorescence correlation spectroscopy, Proc. Natl. Acad. Sci. USA 93:1281. 
[38] Bagatolli, L.A., and Gratton, E. (2000). Two-photon fluorescence microscopy of coexisting lipid domains in 

giant unilamellar vesicles of binary phospholipid mixtures. Biophys J., 78:290-305. 
[39] Schwille, P., Haupts, U., Maiti, S., and Webb. W.(1999). Molecular dynamics in living cells observed by 

fluorescence correlation spectroscopy with one- and two-photon excitation. Biophysical Journal, 

77(10):2251-2265. 
[40] Near Infrared Microspectroscopy, Fluorescence Microspectroscopy, Infrared Chemical Imaging and High 

Resolution Nuclear Magnetic Resonance Analysis of Soybean Seeds, Somatic Embryos and Single Cells., 

Baianu, I.C. et al. 2004., In Oil Extraction and Analysis., D. Luthria, Editor pp. 241-273, AOCS Press., 

Champaign, IL. 
[41] FRET description (http://dwb.unl.edu/Teacher/NSF/C08/C08Links/pps99.cryst.bbk.ac.uk/projects/ 

gmocz/fret. htm) 
[42] doi:10.1016/S0959-440X(00)00190-l (http://dx.doi.org/10. 1016/S0959-440X(00)00190-l)Recent 

advances in FRET: distance determination in protein-DNA complexes. Current Opinion in Structural Biology 

2001, 11(2), 201-207 
[43] http://www.fretimaging.org/mcnamaraintro.html FRET imaging introduction 



Molecular models of DNA 158 

[44] Eigen, M., and Rigler, R. (1994). Sorting single molecules: Applications to diagnostics and evolutionary 

biotechnology, Proc. Natl. Acad. Sci. USA 91:5740. 
[45] Hallin PF, David Ussery D (2004). "CBS Genome Atlas Database: A dynamic storage for bioinformatic results 

and DNA sequence data". Bioinformatics 20: 3682-3686. 
[46] Zhang CT, Zhang R, Ou HY (2003). "The Z curve database: a graphic representation of genome sequences". 

Bioinformatics 19 (5): 593-599. doi:10.1093/bioinformatics/btg041 

References 

Applications of Novel Techniques to Health Foods, Medical and Agricultural 

Biotechnology.Qune 2004) I. C. Baianu, P. R. Lozano, V. I. Prisecaru and H. C. Lin., 

q-bio/0406047. 

F. Bessel, Untersuchung des Theils der planetarischen Storungen, Berlin Abhandlungen 

(1824), article 14. 

Sir Lawrence Bragg, FRS. The Crystalline State, A General survey. London: G. Bells and 

Sons, Ltd., vols. 1 and 2., 1966., 2024 pages. 

Cantor, C. R. and Schimmel, P.R. Biophysical Chemistry, Parts I and II. , San Franscisco: 

W.H. Freeman and Co. 1980. 1,800 pages. 

Eigen, M., and Rigler, R. (1994). Sorting single molecules: Applications to diagnostics 

and evolutionary biotechnology, Proc. Natl. Acad. Sci. USA 91:5740. 

Raghavachari, R., Editor. 2001. Near-Infrared Applications in Biotechnology, 

Marcel-Dekker, New York, NY. 

Rigler R. and Widengren J. (1990). Ultrasensitive detection of single molecules by 

fluorescence correlation spectroscopy, BioScience (Ed. Klinge & Owman) p. 180. 

Single Cancer Cell Detection by Near Infrared Microspectroscopy, Infrared Chemical 

Imaging and Fluorescence Microspectroscopy.2004. I. C. Baianu, D. Costescu, N. E. 

Hofmann, S. S. Korban and et al., q-bio/0407006 (July 2004). 

Voet, D. and J.G. Voet. Biochemistry, 2nd Edn., New York, Toronto, Singapore: John Wiley 

& Sons, Inc., 1995, ISBN 0-471-58651-X., 1361 pages. 

Watson, G. N. A Treatise on the Theory of Bessel Functions., (1995) Cambridge 

University Press. ISBN 0-521-48391-3. 

Watson, James D. and Francis H.C. Crick. A structure for Deoxyribose Nucleic Acid 

(http://www.nature.com/nature/dna50/watsoncrick.pdf) (PDF). Nature 111, 737-738, 

25 April 1953. 

Watson, James D. Molecular Biology of the Gene. New York and Amsterdam: W.A. 

Benjamin, Inc. 1965., 494 pages. 

Wentworth, W.E. Physical Chemistry. A short course., Maiden (Mass.): Blackwell Science, 

Inc. 2000. 

Herbert R. Wilson, FRS. Diffraction of X-rays by proteins, Nucleic Acids and Viruses., 

London: Edward Arnold (Publishers) Ltd. 1966. 

Kurt Wuthrich. NMR of Proteins and Nucleic Acids., New York, Brisbane,Chicester, 

Toronto, Singapore: J. Wiley & Sons. 1986., 292 pages. 

Robinson, Bruche H.; Seeman, Nadrian C. (August 1987). "The Design of a Biochip: A 

Self-Assembling Molecular-Scale Memory Device". Protein Engineering 1 (4): 295-300. 

ISSN 0269-2139 (http://worldcat.org/issn/0269-2139). Link (http://peds. 

oxf ordj ournals . org/cgi/content/abstract/ 1/4/295) 

Rothemund, Paul W. K.; Ekani-Nkodo, Axel; Papadakis, Nick; Kumar, Ashish; Fygenson, 

Deborah Kuchnir & Winfree, Erik (22 December 2004). "Design and Characterization of 

Programmable DNA Nanotubes". Journal of the American Chemical Society 126 (50): 



Molecular models of DNA 1 59 

16344-16352. doi: 10.1021/ja0443191 (http://dx.doi.org/10.1021/ja0443191). ISSN 
0002-7863 (http://worldcat.org/issn/0002-7863). 

• Keren, K.; Kinneret Keren, Rotem S. Berman, Evgeny Buchstab, Uri Sivan, Erez Braun 
(November 2003). 

"http://www.sciencemag.org/cgi/content/abstract/sci;302/5649/1380|DNA-Templated 
Carbon Nanotube Field-Effect Transistor". Science 302 (6549): 1380-1382. doi: 

10.1 126/science. 1091022 (http://dx.doi.org/10.1126/science.1091022). ISSN 
1095-9203 (http://worldcat.org/issn/1095-9203). http://www.sciencemag.org/cgi/ 
content/abstract/sci;302/5649/1380. 

• Zheng, Jiwen; Constantinou, Pamela E.; Micheel, Christine; Alivisatos, A. Paul; Kiehl, 
Richard A. & Seeman Nadrian C. (2006). "2D Nanoparticle Arrays Show the 
Organizational Power of Robust DNA Motifs". Nano Letters 6: 1502-1504. doi: 
10.1021/nl060994c (http://dx.doi.org/10.1021/nl060994c). ISSN 1530-6984 (http:// 
worldcat.org/issn/1530-6984). 

• Cohen, Justin D.; Sadowski, John P.; Dervan, Peter B. (2007). "Addressing Single 
Molecules on DNA Nanostructures". Angewandte Chemie 46 (42): 7956-7959. doi: 
10.1002/anie.200702767 (http://dx.doi.org/10.1002/anie.200702767). ISSN 
0570-0833 (http://worldcat.org/issn/0570-0833). 

• Mao, Chengde; Sun, Weiqiong & Seeman, Nadrian C. (16 June 1999). "Designed 
Two-Dimensional DNA Holliday Junction Arrays Visualized by Atomic Force Microscopy". 
Journal of the American Chemical Society 121 (23): 5437-5443. doi: 10.1021/ja9900398 
(http://dx.doi.org/10.1021/ja9900398). ISSN 0002-7863 (http://worldcat.org/issn/ 
0002-7863). 

• Constantinou, Pamela E.; Wang, Tong; Kopatsch, Jens; Israel, Lisa B.; Zhang, Xiaoping; 
Ding, Baoquan; Sherman, William B.; Wang, Xing; Zheng, Jianping; Sha, Ruojie & 
Seeman, Nadrian C. (2006). "Double cohesion in structural DNA nanotechnology". 
Organic and Biomolecular Chemistry 4: 3414-3419. doi: 10.1039/b605212f (http://dx. 
doi.org/10.1039/b605212f). 

See also 

DNA 

Molecular graphics 

DNA structure 

DNA Dynamics 

X-ray scattering 

Neutron scattering 

Crystallography 

Crystal lattices 

Paracrystalline lattices/Paracrystals 

2D-FT NMRI and Spectroscopy 

NMR Spectroscopy 

Microwave spectroscopy 

Two-dimensional IR spectroscopy 

Spectral imaging 

Hyperspectral imaging 

Chemical imaging 

NMR microscopy 



Molecular models of DNA 160 

VCD or Vibrational circular dichroism 

FRET and FCS- Fluorescence correlation spectroscopy 

Fluorescence cross-correlation spectroscopy (FCCS) 

Molecular structure 

Molecular geometry 

Molecular topology 

DNA topology 

Sirius visualization software 

Nanostructure 

DNA nanotechnology 

Imaging 

Atomic force microscopy 

X-ray microscopy 

Liquid crystal 

Glasses 

QMC@Home 

Sir Lawrence Bragg, FRS 

Sir John Randall 

James Watson 

Francis Crick 

Maurice Wilkins 

Herbert Wilson, FRS 

Alex Stokes 

External links 

DNA the Double Helix Game (http://nobelprize.org/educational_games/medicine/ 

dnadoublehelix/) From the official Nobel Prize web site 

MDDNA: Structural Bioinformatics of DNA (http://humphry.chem. wesleyan.edu:8080/ 

MDDNA/) 

Double Helix 1953-2003 (http://www.ncbe.reading.ac.uk/DNA50/) National Centre 

for Biotechnology Education 

DNA under electron microscope (http://www.fidelitysystems.com/Unlinked_DNA. 

html) 

Ascalaph DNA (http://www.agilemolecule.com/Ascalaph/Ascalaph_DNA.html) — 

Commercial software for DNA modeling 

DNAlive: a web interface to compute DNA physical properties (http://mmb.pcb.ub.es/ 

DNAlive). Also allows cross-linking of the results with the UCSC Genome browser and 

DNA dynamics. 

DiProDB: Dinucleotide Property Database (http://diprodb.fli-leibniz.de). The database 

is designed to collect and analyse thermodynamic, structural and other dinucleotide 

properties. 

Further details of mathematical and molecular analysis of DNA structure based on X-ray 

data (http ://planetphysics . org/encyclopedia/ 

BesselFunctionsApplicationsToDiffractionByHelicalStructures.html) 

Bessel functions corresponding to Fourier transforms of atomic or molecular helices. 

(http://planetphysics.org/?op=getobj&from=objects& 

name=BesselFunctionsAndTheirApplicationsToDiffractionByHelicalStructures) 
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• Application of X-ray microscopy in analysis of living hydrated cells (http://www.ncbi. 
nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract& 
list_uids=12379938) 

• Characterization in nanotechnology some pdfs (http://nanocharacterization.sitesled. 
com/) 

• overview of STM/AFM/SNOM principles with educative videos (http://www.ntmdt.ru/ 
SPM-Techniques/Principles/) 

• SPM Image Gallery - AFM STM SEM MFM NSOM and More (http://www.rhk-tech.com/ 
results/showcase. php) 

• How SPM Works (http://www.parkafm.com/New_html/resources/01general.php) 

• U.S. National DNA Day (http://www.genome.gov/10506367) — watch videos and 
participate in real-time discussions with scientists. 

• The Secret Life of DNA - DNA Music compositions (http://www.tjmitchell.com/stuart/ 
dna.html) 



Molecular graphics 



Molecular graphics (MG) is the discipline and philosophy of studying molecules and their 
properties through graphical representation. J IUPAC limits the definition to 
representations on a "graphical display device". ] Ever since Dalton's atoms and Kekule's 
benzene, there has been a rich history of hand-drawn atoms and molecules, and these 
representations have had an important influence on modern molecular graphics. This 
article concentrates on the use of computers to create molecular graphics. Note, however, 
that many molecular graphics programs and systems have close coupling between the 
graphics and editing commands or calculations such as in molecular modelling. 



Relation to molecular models 

There has been a long tradition of creating 
molecular models from physical materials. 
Perhaps the best known is Crick and 
Watson's model of DNA built from rods and 
planar sheets, but the most widely used 
approach is to represent all atoms and 
bonds explicitly using the "ball and stick" 
approach. This can demonstrate a wide 
range of properties, such as shape, relative 
size, and flexibility. Many chemistry 
courses expect that students will have 
access to ball and stick models. One goal of 
mainstream molecular graphics has been to 
represent the "ball and stick" model as 
realistically as possible and to couple this 
with calculations of molecular properties. 

Figure 1 shows a small molecule (NH 3 CH 2 CH 2 C(OH)(P0 3 H)(P0 3 H)-), as drawn by the Jmol 
program. It is important to realise that the colours are purely a convention. Molecules can 




Fig. 1. Key: Hydrogen = white, carbon = grey, 

nitrogen = blue, oxygen = red, and phosphorus = 

orange. 
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never be visible under any light microscope and atoms are not coloured, do not have hard 
surfaces and do not reflect light. Bonds are not rod-shaped. If physical molecular models 
had not existed, it is unlikely that molecular graphics would currently use this metaphor. 

Comparison of physical models with molecular graphics 

Physical models and computer models have partially complementary strengths and 
weaknesses. Physical models can be used by those without access to a computer and now 
can be made cheaply out of plastic materials. Their tactile and visual aspects cannot be 
easily reproduced by computers (although haptic devices have occasionally been built). On 
a computer screen, the flexibility of molecules is also difficult to appreciate; illustrating the 
pseudorotation of cyclohexane is a good example of the value of mechanical models. 

However, it is difficult to build large physical molecules, and all-atom physical models of 
even simple proteins could take weeks or months to build. Moreover, physical models are 
not robust and they decay over time. Molecular graphics is particularly valuable for 
representing global and local properties of molecules, such as electrostatic potential. 
Graphics can also be animated to represent molecular processes and chemical reactions, a 
feat that is not easy to reproduce physically. 

History 

Initially the rendering was on early CRT screens or through plotters drawing on paper. 
Molecular structures have always been an attractive choice for developing new computer 
graphics tools, since the input data are easy to create and the results are usually highly 
appealing. The first example of MG was a display of a protein molecule (Project MAC, 1966) 
by Cyrus Levinthal and Robert Langridge. Among the milestones in high-performance MG 
was the work of Nelson Max in "realistic" rendering of macromolecules using reflecting 
spheres. 

By about 1980 many laboratories both in academia and industry had recognized the power 
of the computer to analyse and predict the properties of molecules, especially in materials 
science and the pharmaceutical industry. The discipline was often called "molecular 
graphics" and in 1982 a group of academics and industrialists in the UK set up the 
Molecular Graphics Society (MGS). Initially much of the technology concentrated either on 
high-performance 3D graphics, including interactive rotation or 3D rendering of atoms as 
spheres (sometimes with radiosity). During the 1980s a number of programs for calculating 
molecular properties (such as molecular dynamics and quantum mechanics) became 
available and the term "molecular graphics" often included these. As a result the MGS has 
now changed its name to the Molecular Graphics and Modelling Society (MGMS). 

The requirements of macromolecular crystallography also drove MG because the traditional 
techniques of physical model-building could not scale. Alwyn Jones' FRODO program (and 
later "O") were developed to overlay the molecular electron density determined from X-ray 
crystallography and the hypothetical molecular structure. 
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Art, science and technology in molecular graphics 

Both computer technology and graphic arts have 
contributed to molecular graphics. The development 
of structural biology in the 1950s led to a 
requirement to display molecules with thousands of 
atoms. The existing computer technology was 
limited in power, and in any case a naive depiction 
of all atoms left viewers overwhelmed. Most systems 
therefore used conventions where information was 
implicit or stylistic. Two vectors meeting at a point 
implied an atom or (in macromolecules) a complete 
residue (10-20 atoms). 

The macromolecular approach was popularized by 
Dickerson and Gels' presentation of proteins and the 
graphic work of Jane Richardson through 
high-quality hand-drawn diagrams such as the 
"ribbon" representation. In this they strove to 
capture the intrinsic 'meaning' of the molecule. This 
search for the "messages in the molecule" has 
always accompanied the increasing power of 
computer graphics processing. Typically the 
depiction would concentrate on specific areas of the 
molecule (such as the active site) and this might 
have different colours or more detail in the number 
of explicit atoms or the type of depiction (e.g., 
spheres for atoms). 




Fig. 2. Image of hemagglutinin with alpha 

helices depicted as cylinders and the rest 

of the chain as silver coils. The individual 

protein atoms (several thousand) have 

been hidden. All of the non-hydrogen atoms 

in the two ligands (presumably sialic acid) 

have been shown near the top of the 

diagram. Key: Carbon = grey, oxygen = 

red, nitrogen = blue. 



In some cases the limitations of technology have led 

to serendipitous methods for rendering. Most early graphics devices used vector graphics, 
which meant that rendering spheres and surfaces was impossible. Michael Connolly's 
program "MS" calculated points on the surface-accessible surface of a molecule, and the 
points were rendered as dots with good visibility using the new vector graphics technology, 
such as the Evans and Sutherland PS300 series. Thin sections ("slabs") through the 
structural display showed very clearly the complementarity of the surfaces for molecules 
binding to active sites, and the "Connolly surface" became a universal metaphor. 

The relationship between the art and science of molecular graphics is shown in the 
exhibitions L J sponsored by the Molecular Graphics Society. Some exhibits are created with 
molecular graphics programs alone, while others are collages, or involve physical materials. 
An example from Mike Hann (1994), inspired by Magritte's painting Ceci n l est pas une 
pipe, uses an image of a salmeterol molecule. 

"Ceci n l est pas une molecule/ 1 writes Mike Hann, "serves to remind us that all of the 
graphics images presented here are not molecules, not even pictures of molecules, but 
pictures of icons which we believe represent some aspects of the molecule's properties." 
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Space-filling models 

Fig. 4 is a "space-filling" representation of formic acid, 

where atoms are drawn to suggest the amount of space 

they occupy. This is necessarily an icon: in the quantum 

mechanical representation of molecules, there are only 

(positively charged) nuclei and a "cloud" of negative 

electrons. The electron cloud defines an approximate 

size for the molecule, though there can be no single 

precise definition of size. For many years the size of 

atoms has been approximated by mechanical models 

(CPK), where the atoms have been represented by 

plastic spheres whose radius (van der Waals radius) 

describes a sphere within which "most" of the electron 

density can be found. These spheres could be clicked 

together to show the steric aspects of the molecule 

rather than the positions of the nuclei. Fig. 4 shows the 

intricacy required to make sure that all spheres intersect correctly, and also demonstrates 

a reflective model. 




Fig. 4. Space-filling model of formic 

acid. Key: Hydrogen = white, carbon = 

black, oxygen = red. 




Fig. 5. A molecule (zirconocene) where 

part (left) is rendered as ball-and-stick 

and part (right) as an isosurface. 



Since the atomic radii (e.g. in Fig. 4) are only slightly 
less than the distance between bonded atoms, the 
iconic spheres intersect, and in the CPK models, this 
was achieved by planar truncations along the bonding 
directions, the section being circular. When raster 
graphics became affordable, one of the common 
approaches was to replicate CPK models in silico. It is 
relatively straightforward to calculate the circles of 
intersection, but more complex to represent a model 
with hidden surface removal. A useful side product is 
that a conventional value for the molecular volume can 
be calculated. 



The use of spheres is often for convenience, being 
limited both by graphics libraries and the additional effort required to compute complete 
electronic density or other space-filling quantities. It is now relatively common to see 
images of isosurfaces that have been coloured to show quantities such as electrostatic 
potential. The commonest isosurfaces are the Connolly surface, or the volume within which 
a given proportion of the electron density lies. The isosurface in Fig. 5 appears to show the 
electrostatic potential, with blue colours being negative and red/yellow (near the metal) 
positive. (There is no absolute convention of colouring, and red/positive, blue/negative are 
often confusingly reversed!) Opaque isosurfaces do not allow the atoms to be seen and 
identified and it is not easy to deduce them. Because of this, isosurfaces are often drawn 
with a degree of transparency. 
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Technology 

Molecular graphics has always pushed the limits of display technology, and has seen a 
number of cycles of integration and separation of compute-host and display. Early systems 
like Project MAC were bespoke and unique, but in the 1970s the MMS-X and similar 
systems used (relatively) low-cost terminals, such as the Tektronix 4014 series, often over 
dial-up lines to multi-user hosts. The devices could only display static pictures but, were 
able to evangelize MG. In the late 1970s, it was possible for departments (such as 
crystallography) to afford their own hosts (e.g., PDP-11) and to attach a display (such as 
Evans & Sutherland's MPS) directly to the bus. The display list was kept on the host, and 
interactivity was good since updates were rapidly reflected in the display— at the cost of 
reducing most machines to a single-user system. 

In the early 1980s, Evans & Sutherland (E&S) decoupled their PS300 display, which 
contained its own display information transformable through a dataflow architecture. 
Complex graphical objects could be downloaded over a serial line (e.g. 9600 baud) and then 
manipulated without impact on the host. The architecture was excellent for high 
performance display but very inconvenient for domain-specific calculations, such as 
electron-density fitting and energy calculations. Many crystallographers and modellers 
spent arduous months trying to fit such activities into this architecture. 

The benefits for MG were considerable, but by the later 1980s, UNIX workstations such as 
Sun-3 with raster graphics (initially at a resolution of 256 by 256) had started to appear. 
Computer-assisted drug design in particular required raster graphics for the display of 
computed properties such as atomic charge and electrostatic potential. Although E&S had a 
high-end range of raster graphics (primarily aimed at the aerospace industry) they failed to 
respond to the low-end market challenge where single users, rather than engineering 
departments, bought workstations. As a result the market for MG displays passed to Silicon 
Graphics, coupled with the development of minisupercomputers (e.g., CONVEX and Alliant) 
which were affordable for well-supported MG laboratories. Silicon Graphics provided a 
graphics language, IrisGL, which was easier to use and more productive than the PS300 
architecture. Commercial companies (e.g., Biosym, Polygen/MSI) ported their code to 
Silicon Graphics, and by the early 1990s, this was the "industry standard". 

Stereoscopic displays were developed based on liquid crystal polarized spectacles, and 
while this had been very expensive on the PS300, it now became a commodity item. A 
common alternative was to add a polarizable screen to the front of the display and to 
provide viewers with extremely cheap spectacles with orthogonal polarization for separate 
eyes. With projectors such as Barco, it was possible to project stereoscopic display onto 
special silvered screens and supply an audience of hundreds with spectacles. In this way 
molecular graphics became universally known within large sectors of chemical and 
biochemical science, especially in the pharmaceutical industry. Because the backgrounds of 
many displays were black by default, it was common for modelling sessions and lectures to 
be held with almost all lighting turned off. 

In the last decade almost all of this technology has become commoditized. IrisGL evolved to 
OpenGL so that molecular graphics can be run on any machine. In 1992, Roger Sayle 
released his RasMol program into the public domain. RasMol contained a very 
high-performance molecular renderer that ran on Unix/X Window, and Sayle later ported 
this to the Windows and Macintosh platforms. The Richardsons developed kinemages and 
the Mage software, which was also multi-platform. By specifying the chemical MIME type, 
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molecular models could be served over the Internet, so that for the first time MG could be 
distributed at zero cost regardless of platform. In 1995, Birkbeck College's crystallography 
department used this to run "Principles of Protein Structure", the first multimedia course 
on the Internet, which reached 100 to 200 scientists. 





Fig. 6. A molecule of Porin (protein) shown without ambient occlusion (left) and with (right). Advanced rendering 
effects can improve the comprehension of the 3D shape of a molecule. 



MG continues to see innovation that balances technology and art, and currently zero-cost or 
open source programs such as PyMOL and Jmol have very wide use and acceptance. 

Recently the wide spread diffusion of advanced graphics hardware, has improved the 
rendering capabilities of the visualization tools. The capabilities of current shading 
languages allow the inclusion of advanced graphic effects (like ambient occlusion, cast 
shadows and non-photorealistic rendering techniques) in the interactive visualization of 
molecules. These graphic effects, beside being eye candy, can improve the comprehension 
of the three dimensional shapes of the molecules. An example of the effects that can be 
achieved exploiting recent graphics hardware can be seen in the simple open source 
visualization system QuteMol. 



Algorithms 

Reference frames 

Drawing molecules requires a transformation between molecular coordinates (usually, but 
not always, in Angstrom units) and the screen. Because many molecules are chiral it is 
essential that the handedness of the system (almost always right-handed) is preserved. In 
molecular graphics the origin (0, 0) is usually at the lower left, while in many computer 
systems the origin is at top left. If the z-coordinate is out of the screen (towards the viewer) 
the molecule will be referred to right-handed axes, while the screen display will be 
left-handed. 

Molecular transformations normally require: 

• scaling of the display (but not the molecule). 

• translations of the molecule and objects on the screen. 

• rotations about points and lines. 

Conformational changes (e.g. rotations about bonds) require rotation of one part of the 
molecule relative to another. The programmer must decide whether a transformation on the 
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screen reflects a change of view or a change in the molecule or its reference frame. 

Simple 

In early displays only vectors could be drawn e.g. (Fig. 
7) which are easy to draw because no rendering or 
hidden surface removal is required. 

On vector machines the lines would be smooth but on 
raster devices Bresenham's algorithm is used (note the 
"jaggies" on some of the bonds, which can be largely 
removed with antialiasing software.) 

Atoms can be drawn as circles, but these should be 
sorted so that those with the largest z-coordinates 
(nearest the screen) are drawn last. Although 
imperfect, this often gives a reasonably attractive 

Fig. 7. Stick model of caffeine drawn in 

Tmol display. Other simple tricks which do not include 

hidden surface algorithms are: 

• colouring each end of a bond with the same colour as the atom to which it is attached 
(Fig. 7). 

• drawing less than the whole length of the bond (e.g. 10%-90%) to simulate the bond 
sticking out of a circle. 

• adding a small offset white circle within the circle for an atom to simulate reflection. 

Typical pseudocode for creating Fig. 7 (to fit the molecule exactly to the screen): 

// assume: 

// atoms with x, y, z coordinates (Angstrom) and elementSymbol 
// bonds with pointers/references to atoms at ends 
// table of colours for elementTypes 

// find limits of molecule in molecule coordinates as xMin, yMin, xMax, 
yMax 
scale = min(xScreenMax/(xMax-xMin) , yScreenMax/(yMax-yMin) ) 
xOffset = -xMin * scale; yOffset = -yMin * scale 
for (bond in $bonds) { 

atomO = bond.getAtom(O) 

atoml = bond.getAtom(l) 

x0 = xOffset+atom0.getX()*scale; y0 = yOffset+atom0.getY( )*scale // 
(1) 

xl = xOf f set+atoml.getX( )*scale; yl = yOff set+atoml.getY( )*scale // 
(2) 

xl = atoml. getX() ; yl = atoml. getY() 

xMid = (x0 + xl) /2; yMid = (y0 + yl) /2; 

colourO = ColourTable.getColour(atom0.getSymbol() ) 

drawLine (colourO, x0, y0, xMid, yMid) 

colourl = ColourTable.getColour(atoml.getSymbol() ) 

drawLine (colourl, xl, yl, xMid, yMid) 
} 
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Note that this assumes the origin is in the bottom left corner of the screen, with Y up the 
screen. Many graphics systems have the origin at the top left, with Y down the screen. In 
this case the lines (1) and (2) should have the y coordinate generation as: 



y0 = yScreenMax 
yl = yScreenMax 



(yOffset+atom0.getY()*scale) // (1) 
(yOffset+atoml.getY()*scale) // (2) 



Changes of this sort change the handedness of the axes so it is easy to reverse the chirality 
of the displayed molecule unless care is taken. 

Advanced 

For greater realism and better comprehension of the 3D structure of a molecule many 
computer graphics algorithms can be used. For many years molecular graphics has 
stressed the capabilities of graphics hardware and has required hardware-specific 
approaches. With the increasing power of machines on the desktop, portability is more 
important and programs such as Jmol have advanced algorithms that do not rely on 
hardware. On the other hand recent graphics hardware is able to interactively render very 
complex molecule shapes with a quality that would not be possible with standard software 
techniques. 



Chronology 

This table provides an incomplete chronology of molecular graphics advances. 



Developer(s) 


Approximate 
date 


Technology 


Comments 


Crystallographers 


< 1960 


Hand-drawn 


Crystal structures, with hidden atom 
and bond removal. Often clinographic 
projections. 


Cyrus Levinthal, Bob 
Langridge 


1960s 


CRT 


First protein display on screen (Project 
MAC). 


Johnson, Motherwell 


ca 1970 


Pen plotter 


ORTEP, PLUTO. Very widely deployed 
for publishing crystal structures. 


Langridge, White, 
Marshall 


Late 1970s 


Departmental systems 
(PDP-11, Tektronix 
displays or DEC-VT11, e.g. 
MMS-X) 


Mixture of commodity computing with 
early displays. 


T. Alwyn Jones 


1978 


FRODO 


Crystallographic structure solution. 


Davies, Hubbard 


Mid-1980s 


CHEM-X, HYDRA 


Laboratory systems with multicolor, 
raster and vector devices (Sigmex, 
PS300). 


Biosym, Tripos, Polygen 


Mid-1980s 


PS300 and lower cost 
dumb terminals (VT200, 
SIGMEX) 


Commercial integrated modelling and 
display packages. 


Silicon Graphics, Sun 


Late 1980s 


IRIS GL (UNIX) 
workstations 


Commodity-priced single-user 
workstations with stereoscopic 
display. 


EMBL - WHAT IF [4] 


1989, 2000 


Machine independent 


Nearly free, multifunctional, still fully 
supported, many free servers 
based on it 
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Sayle, Richardson 


1992, 1993 


RasMol, Kinemage 


Platform-independent MG. 


MDL (van Vliet, Maffett, 
Adler, Holt) 


1995-1998 


Chime 


proprietary C + + ; free browser plugin 
for Mac (OS9) and PCs 


ChemAxon 


1998- 


MarvinSketch [6] & 

[7] 
Marvin View 

MarvinSpace [8] (2005) 


proprietary Java applet or stand-alone 
application. 


Community efforts 


2000- 


Jmol, PyMol, Protein 
Workshop (www.pdb.org) 


Open-source Java applet or 
stand-alone application. 


San Diego Supercomputer 
Center 


2006- 


Sirius 


Free for academic/non-profit 
institutions 


NOCH 


2002- 


NOC [9] 


Powerful and open source code 
molecular structure explorer 


Weizmann Institute of 
Science - Community 
efforts 


2008- 


Proteopedia 


Collaborative, 3D wiki encyclopedia of 
proteins & other molecules 
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See also 

• Molecular Design software 

• Molecular model 

• Molecular modelling 

• Molecular geometry 

• Software for molecular mechanics modeling 

External links 



• The PyMOL Molecular Graphics System (http://pymol.sf.net) -- open source 

• PyMOLWiki (http://pymolwiki.org) -- community supported wiki for PyMOL 

• History of Visualization of Biological Macromolecules (http://www.umass.edu/ 
microbio/rasmol/history.htm) by Eric Martz and Eric Francoeur. 

• Brief History of Molecular Mechanics/Graphics (http://stanley.chem.lsu.edu/webpub/ 
7770-Lecture-l-intro.pdf) in LSU CHEM7770 lecture notes. 

• Historical slides (http://luminary.stanford.edu/langridge/slides.htm) from Robert 
(Bob) Langridge. These show the influence of Crick and Watson on molecular graphics 
(including Levinthal's) and the development of early display technology, finishing with 
displays which were common in the mid-1980s on machines such as Evans and 
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Sutherland's PS300 series. 

Interview with Langridge. (http://luminary.stanford.edu/langridge/langridge.html) 

The display looking down the axis of B-DNA has been likened to a rose window. 

Nelson Max's home page (http://accad.osu.edu/~waynec/history/tree/max.html) 

with links to 1982 classics. 

Jmol home page (http://jmol.sourceforge.net/) contains an applet with an automatic 

display of many features of molecular graphics including metaphors, scripting, 

annotation and animation. 

Richardson Lab (http://kinemage.biochem.duke.edu/) includes Kinemage and 

molecular graphics images. 

History of RasMol. (http://www.openrasmol.org/history.html) 

Molecule of the Month (http://www.rcsb. org/pdb/static.do?p=education_discussion/ 

moleculeofthemonth/index.html) at RCSB/PDB. 

xeo (http://sourceforge.net/projects/xeo) xeo is a free (GPL) open project management 

for nanostructures using Java 

Exhibitions of Molecular Graphics Art (http://www.scripps.edu/mb/goodsell/mgs_art/ 

), 1994, 1998. 

NOCH home page (http://noch.sourceforge.net) A powerful, efficient and open source 

molecular graphics tool. 

eMovie (http://www.weizmann.ac.il/ISPC/eMovie.html): a tool for creation of 

molecular animations with PyMOL. 

Proteopedia (http://www.proteopedia.org): The collaborative, 3D encyclopedia of 

proteins and other molecules. 

Ascalaph Graphics (http://www.agilemolecule.com/Ascalaph/Ascalaph_Graphics. 

html): a molecular viewer with some geometry editing capabilities. 

Molecular Graphics and Modelling Society, (http://www.mgms.org/) 

Journal of Molecular Graphics and Modelling (http://www.sciencedirect.com/ 

science?_ob=JournalURL&_cdi=5260&_auth=y&_acct=C000053194&_version=l& 

_urlVersion=0&_userid=1495569&md5 = le86bcce088e98890cea52f6eda84b64) 

(formally Journal of Molecular Graphics). This journal is not open access. 
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DNA structure 

DNA structure shows a variety of forms, both double-stranded and single-stranded. The 
mechanical properties of DNA, which are directly related to its structure, are a significant 
problem for cells. Every process which binds or reads DNA is able to use or modify the 
mechanical properties of DNA for purposes of recognition, packaging and modification. The 
extreme length (a chromosome may contain a 10 cm long DNA strand), relative rigidity and 
helical structure of DNA has led to the evolution of histones and of enzymes such as 
topoisomerases and helicases to manage a cell's DNA. The properties of DNA are closely 
related to its molecular structure and sequence, particularly the weakness of the hydrogen 
bonds and electronic interactions that hold strands of DNA together compared to the 
strength of the bonds within each strand. 

Experimental techniques which can directly measure the mechanical properties of DNA are 
relatively new, and high-resolution visualization in solution is often difficult. Nevertheless, 
scientists have uncovered large amount of data on the mechanical properties of this 
polymer, and the implications of DNA's mechanical properties on cellular processes is a 
topic of active current research. 

It is important to note the DNA found in many cells can be macroscopic in length - a few 
centimetres long for each human chromosome. Consequently, cells must compact or 
"package" DNA to carry it within them. In eukaryotes this is carried by spool-like proteins 
known as histones, around which DNA winds. It is the further compaction of this 
DNA-protein complex which produces the well known mitotic eukaryotic chromosomes. 

Structure determination 

DNA structures can be determined using either nuclear magnetic resonance spectroscopy 
or X-ray crystallography. The first published reports of A-DNA X-ray diffraction patterns- 
and also B-DNA— employed analyses based on Patterson transforms that provided only a 
limited amount of structural information for oriented fibers of DNA isolated from calf 
thymus. ] [ ] An alternate analysis was then proposed by Wilkins et al. in 1953 for B-DNA 
X-ray diffraction/scattering patterns of hydrated, bacterial oriented DNA fibers and trout 
sperm heads in terms of squares of Bessel functions. ] Although the "B-DNA form' is most 
common under the conditions found in cells, J it is not a well-defined conformation but a 
family or fuzzy set of DNA-conformations that occur at the high hydration levels present in 
a wide variety of living cells. J Their corresponding X-ray diffraction & scattering patterns 
are characteristic of molecular paracrystals with a significant degree of disorder (>20%) [ ] 
L J , and concomitantly the structure is not tractable using only the standard analysis. 

On the other hand, the standard analysis, involving only Fourier transforms of Bessel 
functions^ ] and DNA molecular models, is still routinely employed for the analysis of 
A-DNA and Z-DNA X-ray diffraction patterns. [9] 
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Base pair geometry 

The geometry of a base, or base pair step can be characterized by 6 coordinates: Shift, 
Slide, Rise, Tilt, Roll, and Twist. These values precisely define the location and orientation 
in space of every base or base pair in a DNA molecule relative to its predecessor along the 
axis of the helix. Together, they characterize the helical structure of the molecule. In 
regions of DNA where the "normal" structure is disrupted the change in these values can be 
used to describe such disruption. 

For each base pair, considered relative to its predecessor J L J L J : 

Shear 

Stretch 

Stagger 

Buckle 

Propeller twist 

Rotation of one base with respect to the other in the same base pair. 

Opening 

Shift 

displacement along an axis in the base-pair plane perpendicular to the first, directed 
from the minor to the major groove. 

Slide 

displacement along an axis in the plane of the base pair directed from one strand to 
the other. 

Rise 

displacement along the helix axis. 
Tilt 

rotation around this axis. 
Roll 

rotation around this axis. 
Twist 

rotation around the helix axis. 
x-displacement 
y-displacement 
inclination 
tip 
pitch 

the number of base pairs per complete turn of the helix 

Rise and twist determine the handedness and pitch of the helix. The other coordinates, by 
contrast, can be zero. Slide and shift are typically small in B-DNA, but are substantial in A- 
and Z-DNA. Roll and tilt make successive base pairs less parallel, and are typically small. A 
diagram [ ] of these coordinates can be found in 3DNA [ ] website. 

Note that "tilt" has often been used differently in the scientific literature, referring to the 
deviation of the first, inter-strand base-pair axis from perpendicularity to the helix axis. This 
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corresponds to slide between a succession of base pairs, and in helix-based coordinates is 
properly termed "inclination". 



DNA helix geometries 

Three DNA conformations are believed to be found in nature, A-DNA, B-DNA, and Z-DNA. 
The "B" form described by James D. Watson and Francis Crick is believed to predominate in 
cells L . It is 23.7 A wide and extends 34 A per 10 bp of sequence. The double helix makes 
one complete turn about its axis every 10.4-10.5 base pairs in solution. This frequency of 
twist (known as the helical pitch) depends largely on stacking forces that each base exerts 
on its neighbours in the chain. 

Other conformations are possible; A-DNA, B-DNA, C-DNA, D-DNA 1 



[16] 



E-DNA 1 



[17] 



L-DNA(enantiomeric form of D-DNA) [16] , P-DNA [18] , S-DNA, Z-DNA, etc. have been 
described so far. ] In fact, only the letters F, Q, U, V, and Y are now available to describe 
any new DNA structure that may appear in the future. J L J However, most of these forms 
have been created synthetically and have not been observed in naturally occurring 
biological systems. Also note the triple-stranded DNA possibility. 



A- and Z-DNA 

A-DNA and Z-DNA differ significantly in their geometry and dimensions to B-DNA, although 
still form helical structures. The A form appears likely to occur only in dehydrated samples 
of DNA, such as those used in crystallographic experiments, and possibly in hybrid pairings 
of DNA and RNA strands. Segments of DNA that cells have methylated for regulatory 
purposes may adopt the Z geometry, in which the strands turn about the helical axis the 
opposite way to A-DNA and B-DNA. There is also evidence of protein-DNA complexes 
forming Z-DNA structures. 




Guanine 



Cytosine 



^^^Jl eft £ 5 / anti 

., n *Vh o C1 

syn , 





C5' 



axis of helix 
of Z-DNA 



3' 



The helix axis of A-, B-, and Z-DNA. 



Geometry attribute 


A-DNA 


B-DNA 


Z-DNA 


Helix sense 


right-handed 


right-handed 


left-handed 


Repeating unit 


1 bp 


1 bp 


2bp 


Rotation/bp 


33.6° 


35.9° 


6072bp 


Mean bp/turn 


10.7 


10.0 


12 


Inclination of bp to axis 


+ 19° 


-1.2° 


-9° 


Rise/bp along axis 


2.3 A 


3.32 A 


3.8 A 
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Pitch/turn of helix 


24.6 A 


33.2 A 


45.6 A 


Mean propeller twist 


+ 18° 


+ 16° 


0° 


Glycosyl angle 


anti 


anti 


C: anti, 
G: syn 


Sugar pucker 


C3'-endo 


C2'-endo 


C: C2'-endo / 
G: C2'-exo 


Diameter 


25.5 A 


23.7 A 


18.4 A 



Supercoiled DNA 

The B form of the DNA helix twists 360° per 10.4-10.5 bp in the absence of torsional strain. 
But many molecular biological processes can induce torsional strain. A DNA segment with 
excess or insufficient helical twisting is referred to, respectively, as positively or negatively 
"supercoiled". DNA in vivo is typically negatively supercoiled, which facilitates the 
unwinding (melting) of the double-helix required for RNA transcription. 

Non-helical forms 

Other non-double helical forms of DNA have been described, for example side-by-side (SBS) 
and triple helical configurations. Single stranded DNA may exist in statu nascendi or as 
thermally induced despiralized DNA. 

DNA bending 

DNA is a relatively rigid polymer, typically modelled as a worm-like chain. It has three 
significant degrees of freedom; bending, twisting and compression, each of which cause 
particular limitations on what is possible with DNA within a cell. Twisting/torsional stiffness 
is important for the circularisation of DNA and the orientation of DNA bound proteins 
relative to each other and bending/axial stiffness is important for DNA wrapping and 
circularisation and protein interactions. Compression/extension is relatively unimportant in 
the absence of high tension. 



Persistence length/Axial stiffness 

Example sequences and their persistence lengths (B DNA) 



Sequence 


Persistence Length 
/base pairs 


Random 


154±10 


^repeat 


133±10 


( CAG )repeat 


124±10 


(TATA) 

repeat 


137±10 



DNA in solution does not take a rigid structure but is continually changing conformation 
due to thermal vibration and collisions with water molecules, which makes classical 
measures of rigidity impossible. Hence, the bending stiffness of DNA is measured by the 
persistence length, defined as: 
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"The length of DNA over which the time-averaged orientation of the polymer becomes 
uncorrelated by a factor of e." 

This value may be directly measured using an atomic force microscope to directly image 
DNA molecules of various lengths. In aqueous solution the average persistence length is 
46-50 nm or 140-150 base pairs (the diameter of DNA is 2 nm), although can vary 
significantly. This makes DNA a moderately stiff molecule. 

The persistence length of a section of DNA is somewhat dependent on its sequence, and 
this can cause significant variation. The variation is largely due to base stacking energies 
and the residues which extend into the minor and major grooves. 

Models for DNA bending 

Stacking stability of base steps (B DNA) 



Step 


Stacking AG 
/kcal mol" 1 


TA 


-0.19 


T G or C A 


-0.55 


CG 


-0.91 


A G or C T 


-1.06 


A A or T T 


-1.11 


AT 


-1.34 


G A or T C 


-1.43 


C C or G G 


-1.44 


A C or G T 


-1.81 


G C 


-2.17 



The entropic flexibility of DNA is remarkably consistent with standard polymer physics 
models such as the Kratky-Porod worm-like chain model. Consistent with the worm-like 
chain model is the observation that bending DNA is also described by Hooke's law at very 
small (sub-piconewton) forces. However for DNA segments less than the persistence length, 
the bending force is approximately constant and behaviour deviates from the worm-like 
chain predictions. 

This effect results in unusual ease in circularising small DNA molecules and a higher 
probability of finding highly bent sections of DNA. 



Bending preference 

DNA molecules often have a preferred direction to bend, ie. anisotropic bending. This is, 
again, due to the properties of the bases which make up the DNA sequence - a random 
sequence will have no preferred bend direction, i.e. isotropic bending. 

Preferred DNA bend direction is determined by the stability of stacking each base on top of 
the next. If unstable base stacking steps are always found on one side of the DNA helix then 
the DNA will preferentially bend away from that direction. As bend angle increases then 
steric hindrances and ability to roll the residues relative to each other also play a role, 
especially in the minor groove. A and T residues will be preferentially be found in the minor 
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grooves on the inside of bends. This effect is particularly seen in DNA-protein binding 
where tight DNA bending is induced, such as in nucleosome particles. See base step 
distortions above. 

DNA molecules with exceptional bending preference can become intrinsically bent. This 
was first observed in trypanosomatid kinetoplast DNA. Typical sequences which cause this 
contain stretches of 4-6 T and A residues separated by G and C rich sections which keep 
the A and T residues in phase with the minor groove on one side of the molecule. For 
example: 

I I I I 

I I 

GATTCCCAAAAATGTCAAAAAATAGGCAAAAAATGC 
CAAAAAATCCCAAAC 

The intrinsically bent structure is induced by the 'propeller twist 1 of base pairs relative to 
each other allowing unusual bifurcated Hydrogen-bonds between base steps. At higher 
temperatures this structure, and so the intrinsic bend, is lost. 

All DNA which bends anisotropically has, on average, a longer persistence length and 
greater axial stiffness. This increased rigidity is required to prevent random bending which 
would make the molecule act isotropically. 

DNA circularisation 

DNA circularisation depends on both the axial (bending) stiffness and torsional (rotational) 
stiffness of the molecule. For a DNA molecule to successfully circularise it must be long 
enough to easily bend into the full circle and must have the correct number of bases so the 
ends are in the correct rotation to allow bonding to occur. The optimum length for 
circularisation of DNA is around 400 base pairs (136 nm), with an integral number of turns 
of the DNA helix, i.e. multiples of 10.4 base pairs. Having a non integral number of turns 
presents a significant energy barrier for circularisation, for example a 10.4 x 30 = 312 base 
pair molecule will circularise hundreds of times faster than 10.4 x 30.5 ~ 317 base pair 
molecule. 

DNA stretching 

Longer stretches of DNA are entropically elastic under tension. When DNA is in solution, it 
undergoes continuous structural variations due to the energy available in the solvent. This 
is due to the thermal vibration of the molecule combined with continual collisions with 
water molecules. For entropic reasons, more compact relaxed states are thermally 
accessible than stretched out states, and so DNA molecules are almost universally found in 
a tangled relaxed layouts. For this reason, a single molecule of DNA will stretch under a 
force, straightening it out. Using optical tweezers, the entropic stretching behavior of DNA 
has been studied and analyzed from a polymer physics perspective, and it has been found 
that DNA behaves largely like the Kratky-Porod worm-like chain model under 
physiologically accessible energy scales. 

Under sufficient tension and positive torque, DNA is thought to undergo a phase transition 
with the bases splaying outwards and the phosphates moving to the middle. This proposed 
structure for overstretched DNA has been called "P-form DNA," in honor of Linus Pauling 
who originally presented it as a possible structure of DNA [18] 
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The mechanical properties DNA under compression have not been characterized due to 
experimental difficulties in preventing the polymer from bending under the compressive 
force. 

DNA melting 

Melting stability of base steps (B DNA) 



Step 


Melting AG 
/Kcal mol' 1 


TA 


-0.12 


T G or C A 


-0.78 


CG 


-1.44 


A G or C T 


-1.29 


A A or T T 


-1.04 


AT 


-1.27 


G A or T C 


-1.66 


C C or G G 


-1.97 


A C or G T 


-2.04 


G C 


-2.70 



DNA melting is the process by which the interactions between the strands of the double 
helix are broken, separating the two strands of DNA. These bonds are weak, easily 
separated by gentle heating, enzymes, or physical force. DNA melting preferentially occurs 
at certain points in the DNA. J T and A rich sequences are more easily melted than C and 
G rich regions. Particular base steps are also susceptible to DNA melting, particularly T A 
and T G base steps. ] These mechanical features are reflected by the use of sequences 
such as TATAA at the start of many genes to assist RNA polymerase in melting the DNA for 
transcription. 

Strand separation by gentle heating, as used in PCR, is simple providing the molecules have 
fewer than about 10,000 base pairs (10 kilobase pairs, or 10 kbp). The intertwining of the 
DNA strands makes long segments difficult to separate. The cell avoids this problem by 
allowing its DNA-melting enzymes (helicases) to work concurrently with topoisomerases, 
which can chemically cleave the phosphate backbone of one of the strands so that it can 
swivel around the other. Helicases unwind the strands to facilitate the advance of 
sequence-reading enzymes such as DNA polymerase. 
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DNA topology 



| Twist = -1 , Writhe = 0. (( )) Twist = +1 , Writhe = 0. 



Twist = 0, Writhe = -1. 



Twist = 0, Writhe = +1. 



\ Twist = -2, Writhe = 0. (( \\ Twist = +2, Writhe = 0. 



Twist = 0, Writhe = -2. 




Twist = 0, Writhe = +2. 



O. JO Twist = 0, Writhe = -4. 
Plectonemic Toroidal 

Supercoiled structure of circular DNA 

molecules with low writhe. Note that 

the helical nature of the DNA duplex is 

omitted for clarity. 



Within the cell most DNA is topologically restricted. 
DNA is typically found in closed loops (such as plasmids 
in prokaryotes) which are topologically closed, or as 
very long molecules whose diffusion coefficients 
produce effectively topologically closed domains. Linear 
sections of DNA are also commonly bound to proteins 
or physical structures (such as membranes) to form 
closed topological loops. 

Francis Crick was one of the first to propose the 
importance of linking numbers when considering DNA 
supercoils. In a paper published in 1976, Crick outlined 
the problem as follows: 

In considering supercoils formed by closed 
double-stranded molecules of DNA certain 
mathematical concepts, such as the linking 
number and the twist, are needed. The 
meaning of these for a closed ribbon is 
explained and also that of the writhing 
number of a closed curve. Some simple 
examples are given, some of which may be 
relevant to the structure of chromatin. ] 

Analysis of DNA topology uses three values: 

L = linking number - the number of times one DNA strand wraps around the other. It 
is an integer for a closed loop and constant for a closed topological domain. 

T = twist - total number of turns in the double stranded DNA helix. This will normally 
try to be equal to the number turns a DNA molecule will make while free in solution, 
ie. number of bases/10.4. 

W = writhe - number of turns of the double stranded DNA helix around the 
superhelical axis 

L = T + W and AL = AT + A W 

Any change of T in a closed topological domain must be balanced by a change in W, and 
vice versa. This results in higher order structure of DNA. A circular DNA molecule with a 
writhe of will be circular. If the twist of this molecule is subsequently increased or 
decreased by supercoiling then the writhe will be appropriately altered, making the 
molecule undergo plectonemic or toroidal superhelical coiling. 

When the ends of a piece of double stranded helical DNA are joined so that it forms a circle 
the strands are topologically knotted. This means the single strands cannot be separated 
any process that does not involve breaking a strand (such as heating). The task of 
un-knotting topologically linked strands of DNA falls to enzymes known as topoisomerases. 
These enzymes are dedicated to un-knotting circular DNA by cleaving one or both strands 
so that another double or single stranded segment can pass through. This un-knotting is 
required for the replication of circular DNA and various types of recombination in linear 
DNA which have similar topological constraints. 
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The linking number paradox 

For many years, the origin of residual supercoiling in eukaryotic genomes remained 
unclear. This topological puzzle was referred to by some as the "linking number 
paradox". J However, when experimentally determined structures of the nucleosome 
displayed an overtwisted left-handed wrap of DNA around the histone octamer ] [ 7] , this 
"paradox" was solved. 

See also 

• DNA nanotechnology 

• Molecular models of DNA 
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• MDDNA: Structural Bioinformatics of DNA (http://humphry.chem. wesleyan.edu:8080/ 
MDDNA/) 

• Ascalaph DNA (http://www.agilemolecule.com/Ascalaph/Ascalaph_DNA.html) — 
Commercial software for DNA modeling 

• DNAlive: a web interface to compute DNA physical properties (http://mmb.pcb.ub.es/ 
DNAlive). Also allows cross-linking of the results with the UCSC Genome browser and 
DNA dynamics. 

• DiProDB: Dinucleotide Property Database (http://diprodb.fli-leibniz.de). The database 
is designed to collect and analyse thermodynamic, structural and other dinucleotide 
properties. 
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Neutron scattering 

Neutron scattering encompasses all scientific techniques whereby the deflection of 
neutron radiation is used as a scientific probe. Neutrons readily interact with atomic nuclei 
and magnetic fields from unpaired electrons, making a useful probe of both structure and 
magnetic order. Neutron Scattering falls into two basic categories - elastic and inelastic. 
Elastic scattering is when a neutron interacts with a nucleus or electronic magnetic field 
but does not leave it in an excited state, meaning the emitted neutron has the same energy 
as the injected neutron. Scattering processes that involve an energetic excitation or 
relaxation by the neutron are inelastic: the injected neutron's energy is used or increased to 
create an excitation or by absorbing the excess energy from a relaxation, and consequently 
the emitted neutron's energy is reduced or increased respectively. 

For several good reasons, moderated neutrons provide an ideal tool for the study of almost 
all forms of condensed matter. Firstly, they are readily produced at a nuclear research 
reactor or a spallation source. Normally in such processes neutrons are however produced 
with much higher energies than are needed. Therefore moderators are generally used 
which slow the neutrons down and therefore produce wavelengths that are comparable to 
the atomic spacing in solids and liquids, and kinetic energies that are comparable to those 
of dynamic processes in materials. Moderators can be made from Aluminium and filled with 
liquid hydrogen (for very long wavelength neutrons) or liquid methane (for shorter 
wavelength neutrons). Fluxes of 10 7 /s - 10 8 /s are not atypical in most neutron sources from 
any given moderator. 

The neutrons cause pronounced interference and energy transfer effects in scattering 
experiments. Unlike an x-ray photon with a similar wavelength, which interacts with the 
electron cloud surrounding the nucleus, neutrons interact with the nucleus itself. Because 
the neutron is an electrically neutral particle, it is deeply penetrating, and is therefore more 
able to probe the bulk material. Consequently, it enables the use of a wide range of sample 
environments that are difficult to use with synchrotron x-ray sources. It also has the 
advantage that the cross sections for interaction do not increase with atomic number as 
they do with radiation from a synchrotron x-ray source. Thus neutrons can be used to 
analyse materials with low atomic numbers like proteins and surfactants. This can be done 
at synchrotron sources but very high intensities are needed which may cause the structures 
to change. Moreover, the nucleus provides a very short range, isotropic potential varying 
randomly from isotope to isotope, making it possible to tune the nuclear scattering contrast 
to suit the experiment: 

The neutron has an additional advantage over the x-ray photon in the study of condensed 
matter. It readily interacts with internal magnetic fields in the sample. In fact, the strength 
of the magnetic scattering signal is often very similar to that of the nuclear scattering 
signal in many materials, which allows the simultaneous exploration of both nuclear and 
magnetic structure. Because the neutron scattering amplitude can be measured in absolute 
units, both the structural and magnetic properties as measured by neutrons can be 
compared quantitatively with the results of other characterisation techniques. 
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See also 

• Neutron diffraction 

• Small angle neutron scattering 

• Neutron Reflectometry 

• Inelastic neutron scattering 

• neutron triple-axis spectrometry 

• neutron time-of-flight scattering 

• neutron backscattering 

• neutron spin echo 

• neutron resonance spin echo 

• Neutron scattering facilities 

External links 

• Neutron Scattering - A primer L J (LANL-hosted black and white version L J ) - An 
introductory article written by Roger Pynn (Los Alamos National Laboratory) 
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Paracrystalline lattices/Paracrystals 

Paracrystalline materials are defined as having short and medium range ordering in their 
lattice (similar to the liquid crystal phases) but lacking long-range ordering at least in one 
direction. 1] 

Ordering is the regularity in which atoms appear in a predictable lattice, as measured from 
one point. In a highly ordered, perfectly crystalline material, or single crystal, the location 
of every atom in the structure can be described exactly measuring out from a single origin. 
Conversely, in a disordered structure such as a liquid or amorphous solid, the location of 
the first and perhaps second nearest neighbors can be described from an origin (with some 
degree of uncertainty) and the ability to predict locations decreases rapidly from there out. 
The distance at which atom locations can be predicted is referred to as the correlation 
length £ . A paracrystalline material exhibits correlation somewhere between the fully 
amorphous and fully crystalline. 

The primary, most accessible source of crystallinity information is X-ray diffraction, 
although other techniques may be needed to observe the complex structure of 
paracrystalline materials, such as fluctuation electron microscopy [2] in combination with 
Density of states modeling^ J of electronic and vibrational states. 
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Paracrystalline Model 

The paracrystalline model is a revision of the Continuous Random Network model first 
proposed by W. H. Zachariasen in 1932 L J . The paracrystal model is defined as highly 
strained, microcrystalline grains surrounded by fully amorphous material [5] . This is a 
higher energy state then the continuous random network model. The important distinction 
between this model and the microcrystalline phases is the lack of defined grain boundaries 
and highly strained lattice parameters, which makes calculations of molecular and lattice 
dynamics difficult. A general theory of paracrystals has been formulated in a basic 
textbook^ J , and then further developed/refined by various authors. 

Applications 

The paracrystal model has been useful, for example, in describing the state of partially 
amorphous semiconductor materials after deposition. It has also been successfully applied 
to: synthetic polymers, liquid crystals, biopoloymers [7] , 8] and biomembranes [9] . 

See also 

• X-ray scattering 

• Amorphous solid 

• Single Crystal 

• Polycrystalline 

• Crystallography 

• DNA 

• X-ray pattern of a B-DNA Paracrystal [10] 
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2D-FT NMRI and Spectroscopy 

2D-FT Nuclear magnetic resonance imaging (2D-FT NMRI), or Two-dimensional 
Fourier transform nuclear magnetic resonance imaging (NMRI), is primarily a 
non— invasive imaging technique most commonly used in biomedical research and medical 
radiology/nuclear medicine/MRI to visualize structures and functions of the living systems 
and single cells. For example it can provides fairly detailed images of a human body in any 
selected cross-sectional plane, such as longitudinal, transversal, sagital, etc. The basic 
NMR phenomenon or physical principle^ J is essentially the same in N(MRI), nuclear 
magnetic resonance/FT (NMR) spectroscopy, topical NMR, or even in Electron Spin 
Resonance /EPR; however, the details are significantly different at present for EPR, as only 
in the early days of NMR the static magnetic field was scanned for obtaining spectra, as it 
is still the case in many EPR or ESR spectrometers. NMRI, on the other hand, often utilizes 
a linear magnetic field gradient to obtain an image that combines the visualization of 
molecular structure and dynamics. It is this dynamic aspect of NMRI, as well as its highest 
sensitivity for the X H nucleus that distinguishes it very dramatically from X-ray CAT 
scanning that 'misses' hydrogens because of their very low X-ray scattering factor. 

Thus, NMRI provides much greater contrast especially for the different soft tissues of the 
body than computed tomography (CT) as its most sensitive option observes the nuclear spin 
distribution and dynamics of highly mobile molecules that contain the naturally abundant, 
stable hydrogen isotope X H as in plasma water molecules, blood, disolved metabolites and 
fats. This approach makes it most useful in cardiovascular, oncological (cancer), 
neurological (brain), musculoskeletal, and cartilage imaging. Unlike CT, it uses no ionizing 
radiation, and also unlike nuclear imaging it does not employ any radioactive isotopes. 
Some of the first MRI images reported were published in 1973 [2] and the first study 
performed on a human took place on July 3, 1977. J Earlier papers were also published by 
Sir Peter Mansfield [4] in UK (Nobel Laureate in 2003), and R. Damadian in the USA [5] , 
(together with an approved patent for 'fonar', or magnetic imaging). The detailed physical 
theory of NMRI was published by Peter Mansfield in 1973 [6] . Unpublished 'high-resolution' 
(50 micron resolution) images of other living systems, such as hydrated wheat grains, were 
also obtained and communicated in UK in 1977-1979, and were subsequently confirmed by 
articles published in Nature by Peter Callaghan. 
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NMR Principle 

Certain nuclei such as H 
nuclei, or "fermions 1 have 
spin- 1/2, because there are 
two spin states, referred to as 
"up" and "down" states. The 
nuclear magnetic resonance 
absorption phenomenon 

occurs when samples 

containing such nuclear spins 
are placed in a static magnetic 
field and a very short 
radiofrequency pulse is 
applied with a center, or 
carrier, frequency matching 
that of the transition between 
the up and down states of the 
spin- 1/2 H nuclei that were polarized by the static magnetic field. L J Very low field 




Advanced 4.7 T clinical diagnostics and biomedical research NMR 
Imaging instrument. 



schemes have also been recently reported 



[8] 



Chemical Shifts 

NMR is a very useful family of techniques for chemical and biochemical research because 
of the chemical shift; this effect consists in a frequency shift of the nuclear magnetic 
resonance for specific chemical groups or atoms as a result of the partial shielding of the 
corresponding nuclei from the applied, static external magnetic field by the electron 
orbitals (or molecular orbitals) surrounding such nuclei present in the chemical groups. 
Thus, the higher the electron density surrounding a specific nucleus the larger the chemical 
shift will be. The resulting magnetic field at the nucleus is thus lower than the applied 
external magnetic field and the resonance frequencies observed as a result of such 
shielding are lower than the value that would be observed in the absence of any electronic 
orbital shielding. Furthermore, in order to obtain a chemical shift value independent of the 
strength of the applied magnetic field and allow for the direct comparison of spectra 
obtained at different magnetic field values, the chemical shift is defined by the ratio of the 
strength of the local magnetic field value at the observed (electron orbital-shielded) nucleus 
by the external magnetic field strength, H / H . The first NMR observations of the 

lOC U . q 

chemical shift, with the correct physical chemistry interpretation, were reported for F 
containing compounds in the early 1950s by Herbert S. Gutowsky and Charles P. Slichter 
from the University of Illinois at Urbana (USA). 

A related effect in metals is called the Knight shift, which is due only to the conduction 
electrons. Such conduction electrons present in metals induce an "additional" local field at 
the nuclear site, due to the spin re-orientation of the conduction electrons in the presence 
of the applied (constant), external magnetic field. This is only broadly "similar 1 to the 
chemical shift in either solutions or diamagnetic solids. 
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NMR Imaging Principles 

A number of methods have been devised for combining magnetic field gradients and 
radiofrequency pulsed excitation to obtain an image. Two major maethods involve either 2D 
-FT or 3D-FT [ ] reconstruction from projections, somewhat similar to Computed 
Tomography, with the exception of the image interpretation that in the former case must 
include dynamic and relaxation/contrast enhancement information as well. Other schemes 
involve building the NMR image either point-by-point or line-by-line. Some schemes use 
instead gradients in the rf field rather than in the static magnetic field. The majority of 
NMR images routinely obtained are either by the Two-Dimensional Fourier Transform 
(2D-FT) technique [ ] (with slice selection), or by the Three-Dimensional Fourier Transform 
(3D— FT) techniques that are however much more time consuming at present. 2D-FT NMRI 
is sometime called in common parlance a "spin-warp". An NMR image corresponds to a 
spectrum consisting of a number of "spatial frequencies' at different locations in the sample 
investigated, or in a patient. 1 J A two-dimensional Fourier transformation of such a "real" 
image may be considered as a representation of such "real waves" by a matrix of spatial 
frequencies known as the k-space. We shall see next in some mathematical detail how the 
2D-FT computation works to obtain 2D-FT NMR images. 

Two-dimensional Fourier transform imaging and 
spectroscopy 

A two-dimensional Fourier transform (2D-FT) is computed numerically or carried out in two 
stages, both involving "standard 1 , one-dimensional Fourier transforms. However, the 
second stage Fourier transform is not the inverse Fourier transform (which would result in 
the original function that was transformed at the first stage), but a Fourier transform in a 
second variable— which is N shifted' in value— relative to that involved in the result of the 
first Fourier transform. Such 2D-FT analysis is a very powerful method for both NMRI and 
two-dimensional nuclear magnetic resonance spectroscopy (2D-FT NMRS) [ ] that allows 
the three-dimensional reconstruction of polymer and biopolymer structures at atomic 
resolution. ] for molecular weights (Mw) of dissolved biopolymers in aqueous solutions 
(for example) up to about 50,000 Mw. For larger biopolymers or polymers, more complex 
methods have been developed to obtain limited structural resolution needed for partial 
3D-reconstructions of higher molecular structures, e.g. for up 900,000 Mw or even oriented 
microcrystals in aqueous suspensions or single crystals; such methods have also been 
reported for in vivo 2D-FT NMR spectroscopic studies of algae, bacteria, yeast and certain 
mammalian cells, including human ones. The 2D-FT method is also widely utilized in optical 
spectroscopy, such as 2D-FT NIR hyperspectral imaging (2D-FT NIR-HS), or in MRI 
imaging for research and clinical, diagnostic applications in Medicine. In the latter case, 
2D-FT NIR-HS has recently allowed the identification of single, malignant cancer cells 
surrounded by healthy human breast tissue at about 1 micron resolution, well-beyond the 
resolution obtainable by 2D-FT NMRI for such systems in the limited time available for such 
diagnostic investigations (and also in magnetic fields up to the FDA approved magnetic 
field strength H of 4.7 T, as shown in the top image of the state-of-the-art NMRI 
instrument). A more precise mathematical definition of the "double 1 (2D) Fourier transform 
involved in both 2D NMRI and 2D-FT NMRS is specified next, and a precise example 
follows this generally accepted definition. 
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2D-FT Definition 

A 2D-FT, or two-dimensional Fourier transform, is a standard Fourier transformation of a 
function of two variables, f(x x ), carried first in the first variable x followed by the 
Fourier transform in the second variable x of the resulting function F(s ,x ). Note that in 
the case of both 2D-FT NMRI and 2D-FT NMRS the two independent variables in this 
definition are in the time domain, whereas the results of the two successive Fourier 
transforms have, of course, frequencies as the independent variable in the NMRS, and 
ultimately spatial coordinates for both 2D NMRI and 2D-FT NMRS following computer 
structural recontructions based on special algorithms that are different from FT or 2D-FT. 
Moreover, such structural algorithms are different for 2D NMRI and 2D-FT NMRS: in the 
former case they involve macroscopic, or anatomical structure detrmination, whereas in the 
latter case of 2D-FT NMRS the atomic structure reconstruction algorithms are based on the 
quantum theory of a microphysical (quantum) process such as nuclear Overhauser 
enhancement NOE, or specific magnetic dipole-dipole interactions^ J between neighbor 
nuclei. 

Example 1 

A 2D Fourier transformation and phase correction is applied to a set of 2D NMR (FID) 
signals: s(t ,t ) yielding a real 2D-FT NMR "spectrum 1 (collection of ID FT-NMR spectra) 
represented by a matrix S whose elements are 

S (yuvi) = Re / /ro5(^i^i)exp c "" /2£2) 5(ti ; t 2 )^i^ 2 

where : ^land : Vf i denote the discrete indirect double-quantum and 
single-quantum(detection) axes, respectively, in the 2D NMR experiments. Next, the 
covariance matrix is calculated in the frequency domain according to the following equation 

c (u'„u 2 )=S T S = £[S(*iX)SK* 2 )], with . ^taking all possible 

single-quantum frequency values and with the summation carried out over all discrete, 
double quantum frequencies : ]/ \ . 

Example 2 

Atomic Structure from 2D-FT STEM Images [ 5] of electron distributions in a 
high-temperature cuprate superconductor "paracrystal 1 reveal both the domains (or 
"location 1 ) and the local symmetry of the 'pseudo-gap' in the electron-pair correlation band 
responsible for the high— temperature superconductivity effect (obtained at Cornell 
University). So far there have been three Nobel prizes awarded for 2D-FT NMR/MRI during 
1992-2003, and an additional, earlier Nobel prize for 2D-FT of X-ray data ("CAT scans'); 
recently the advanced possibilities of 2D-FT techniques in Chemistry, Physiology and 
Medicine L J received very significant recognition. J 

Brief explanation of NMRI diagnostic uses in Pathology 

As an example, a diseased tissue such as a malign tumor, can be detected by 2D-FT NMRI 
because the hydrogen nuclei of molecules in different tissues return to their equilibrium 
spin state at different relaxation rates, and also because of the manner in which a malign 
tumor spreads and grows rapidly along the blood vessels adjacent to the tumor, also 
inducing further vascularization to occur. By changing the pulse delays in the RF pulse 
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sequence employed, and/or the RF pulse sequence itself, one may obtain a 
N relaxation— based contrast 1 , or contrast enhancement between different types of body 
tissue, such as normal vs. diseased tissue cells for example. Excluded from such diagnostic 
observations by NMRI are all patients with ferromagnetic metal implants, (e.g., cochlear 
implants), and all cardiac pacemaker patients who cannot undergo any NMRI scan because 
of the very intense magnetic and RF fields employed in NMRI which would strongly 
interfere with the correct functioning of such pacemakers. It is, however, conceivable that 
future developments may also include along with the NMRI diagnostic treatments with 
special techniques involving applied magnetic fields and very high frequency RF. Already, 
surgery with special tools is being experimented on in the presence of NMR imaging of 
subjects.Thus, NMRI is used to image almost every part of the body, and is especially useful 
for diagnosis in neurological conditions, disorders of the muscles and joints, for evaluating 
tumors, such as in lung or skin cancers, abnormalities in the heart (especially in children 
with hereditary disorders), blood vessels, CAD, atherosclerosis and cardiac infarcts L J 
(courtesy of Dr. Robert R. Edelman) 

See also 
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This article incorporates material by the original author from 2D-FT MR- Imaging and 
related Nobel awards (http://planetphysics.org/encyclopedia/2DFTImaging.html) on 
PlanetPhysics (http://planetphysics.org/), which is licensed under the GFDL. 
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FRET and FCS-Fluorescence 
correlation spectroscopy 



Forster resonance energy transfer (abbreviated 
FRET), also known as fluorescence resonance 
energy transfer, resonance energy transfer 
(RET) or electronic energy transfer (EET), is a 
mechanism describing energy transfer between two 
chromophores. 

A donor chromophore, intially in its electronic 
excited state, may transfer energy to an acceptor 
chromophore (in close proximity, typically <10 nm) 
through nonradiative dipole-dipole coupling. This 
mechanism is termed "Forster resonance energy 
transfer" and is named after the German scientist 
Theodor Forster. J When both chromophores are 
fluorescent, the term "fluorescence resonance 
energy transfer" is often used instead, although the 
energy is not actually transferred by fluorescence. ] 

,[4] 




Fluorescently-labeled guanosine 

5'-triphosphate hydrolase ARF reveals the 

protein's localization in the Golgi apparatus 

of a living macrophage. FRET studies 
revealed ARF activation in the Golgi and in 

./hi 



the formation of phagosomes. 



In order to avoid an erroneous interpretation of 
the phenomenon that (even when occurring 

between two fluorescent chromophores) is always a nonradiative transfer of energy, the 
name "Forster resonance energy transfer" is preferred to "fluorescence resonance energy 
transfer" - although the latter enjoys common usage in scientific literature. FRET is 
analogous to Near Field Communication, in that the radius of interaction is much smaller 
than the wavelength of light emitted. In the near field region, the excited chromophore 
emits a virtual photon that is instantly absorbed by a receiving chromophore. These virtual 
photons are undetectable, since their existence violates the conservation of energy and 
momentum, and hence FRET is known as a radiationless mechanism. From quantum 
electrodynamical calculations, it is determined that radiationless (FRET) and radiative 
energy transfer are the short- and long-range asymptotes of a single unified mechanism. ] 

,[6] 



Theoretical basis 

The FRET efficiency ( E ) is the quantum yield of the energy transfer transition, i.e. the 
fraction of energy transfer event occurring per donor excitation event: 

" kf + kvr + Uki 

where fc^ris the rate of energy transfer, ^/the radiative decay rate and the k;are the 

rate constants of any other de-excitation pathway. 

The FRET efficiency depends on many parameters that can be grouped as follows: 

• The distance between the donor and the acceptor 
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• The spectral overlap of the donor emission spectrum and the acceptor absorption 
spectrum. 

• The relative orientation of the donor emission dipole moment and the acceptor 
absorption dipole moment. 

E depends on the donor-to-acceptor separation distance r with an inverse 6th power law 
due to the dipole-dipole coupling mechanism: 

F _ i 

"~1 + (r/flo)' 
with /Jobeing the Forster distance of this pair of donor and acceptor i.e. the distance at 
which the energy transfer efficiency is 50%. The Forster distance depends on the overlap 
integral of the donor emission spectrum with the acceptor absorption spectrum and their 
mutual molecular orientation as expressed by the following equation: 

6 9Q (lnl0)^J 
" 12S<K 5 n*N A 
where Quis the fluorescence quantum yield of the donor in the absence of the acceptor, ^ 2 
is the dipole orientation factor, n is the refractive index of the medium, N^ is Avogadro's 
number, and J is the spectral overlap integral calculated as 

J = J f B (X)e A (X)X 4 dX 

where /bis the normalized donor emission spectrum, and E Ais the acceptor molar 
extinction coefficient, k 2 = 2/3 is often assumed. This value is obtained when both dyes are 
freely rotating and can be considered to be isotropically oriented during the excited state 
lifetime. If either dye is fixed or not free to rotate, then k 2 = 2/3 will not be a valid 
assumption. In most cases, however, even modest reorientation of the dyes results in 
enough orientational averaging that k 2 = 2/3 does not result in a large error in the 
estimated energy transfer distance due to the sixth power dependence of R on k . Even 
when k is quite different from 2/3 the error can be associated with a shift in R and thus 
determinations of changes in relative distance for a particular system are still valid. 
Fluorescent proteins do not reorient on a timescale that is faster than their fluorescence 
lifetime. In this case < k < 4. 

The FRET efficiency relates to the quantum yield and the fluorescence lifetime of the donor 
molecule as follows: 

E = 1 — T^/ru 

where T D and r oare the donor fluorescence lifetimes in the presence and absence of an 
acceptor, respectively, or as 
E = 1 - F^/Fj, 

where F D and fbare the donor fluorescence intensities with and without an acceptor, 
respectively. 
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Methods 

In fluorescence microscopy, fluorescence confocal laser 
scanning microscopy, as well as in molecular biology, 
FRET is a useful tool to quantify molecular dynamics in 
biophysics and biochemistry, such as protein-protein 
interactions, protein-DNA interactions, and protein 
conformational changes. For monitoring the complex 
formation between two molecules, one of them is 
labeled with a donor and the other with an acceptor, 
and these fluorophore-labeled molecules are mixed. 
When they are dissociated, the donor emission is 
detected upon the donor excitation. On the other hand, 
when the donor and acceptor are in proximity (1-10 nm) 
due to the interaction of the two molecules, the 
acceptor emission is predominantly observed because 
of the intermolecular FRET from the donor to the 
acceptor. For monitoring protein conformational 
changes, the target protein is labeled with a donor and 
an acceptor at two loci. When a twist or bend of the 
protein brings the change in the distance or relative 
orientation of the donor and acceptor, FRET change is 
observed. If a molecular interaction or a protein 
conformational change is dependent on ligand binding, 
this FRET technique is applicable to fluorescent 
indicators for the ligand detection. 
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Example of FRET between CFP and 

YFP (Wavelength vs. Absorption): a 

fusion protein containing CFP and YFP 

excited at 440nm wavelength. The 

fluorescent emission peak of CFP 

overlaps the excitation peak of YFP. 

Because the two proteins are adjacent 

to each other, the energy transfer is 

significant-a large proportion of the 

energy from CFP is transferred to YFP 

and creates a much larger YFP 

emission peak. 



FRET studies are scalable: the extent of energy transfer 
is often quantified from the milliliter scale of 
cuvette-based experiments to the femtoliter scale of 
microscopy-based experiments. This quantification can 

be based directly (sensitized emission method) on detecting two emission channels under 
two different excitation conditions (primarily donor and primarily acceptor). However, for 
robustness reasons, FRET quantification is most often based on measuring changes in 
fluorescence intensity or fluorescence lifetime upon changing the experimental conditions 
(e.g. a microscope image of donor emission is taken with the acceptor being present. The 
acceptor is then bleached, such that it is incapable of accepting energy transfer and 
another donor emission image is acquired. A pixel-based quantification using the second 
equation in the theory section above is then possible.) An alternative way of temporarily 
deactivating the acceptor is based on its fluorescence saturation. Exploiting polarisation 
characteristics of light, a FRET quantification is also possible with only a single camera 
exposure. 
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CFP-YFP pairs 

The most popular FRET pair for biological use is a cyan fluorescent protein (CFP)-yellow 
fluorescent protein (YFP) pair. Both are color variants of green fluorescent protein (GFP). 
While labeling with organic fluorescent dyes requires troublesome processes of 
purification, chemical modification, and intracellular injection of a host protein, GFP 
variants can be easily attached to a host protein by genetic engineering. By virtue of GFP 
variants, the use of FRET techniques for biological research is becoming more and more 
popular. 

BRET 

A limitation of FRET is the requirement for external illumination to initiate the fluorescence 
transfer, which can lead to background noise in the results from direct excitation of the 
acceptor or to photobleaching. To avoid this drawback, Bioluminescence Resonance Energy 
Transfer (or BRET) has been developed. This technique uses a bioluminescent luciferase 
(typically the luciferase from Renilla reniformis) rather than CFP to produce an initial 
photon emission compatible with YFP. 

FRET and BRET are also the common tools in the study of biochemical reaction kinetics and 
molecular motors. 

Photobleaching FRET 

FRET efficiencies can also be inferred from the photobleaching rates of the donor in the 
presence and absence of an acceptor. This method can be performed on most fluorescence 
microscopes; one simply shines the excitation light (of a frequency that will excite the donor 
but not the acceptor significantly) on specimens with and without the acceptor fluorophore 
and monitors the donor fluorescence (typically separated from acceptor fluorescence using 
a bandpass filter) over time. The timescale is that of photobleaching, which is seconds to 
minutes, with fluorescence in each curve being given by 

(background) + (constant) * e -< time )/ T pb 

where r P b is the photobleaching decay time constant and depends on whether the acceptor 
is present or not. Since photobleaching consists in the permanent inactivation of excited 
fluorophores, resonance energy transfer from an excited donor to an acceptor fluorophore 
prevents the photobleaching of that donor fluorophore, and thus high FRET efficiency leads 
to a longer photobleaching decay time constant: 

E = 1 - T pb /Vp b 

where ^and ^p^are the photobleaching decay time constants of the donor in the 
presence and in the absence of the acceptor, respectively. (Notice that the fraction is the 
reciprocal of that used for lifetime measurements). 

This technique was introduced by Jovin in 1989. L J Its use of an entire curve of points to 
extract the time constants can give it accuracy advantages over the other methods. Also, 
the fact that time measurements are over seconds rather than nanoseconds makes it easier 
than fluorescence lifetime measurements, and because photobleaching decay rates do not 
generally depend on donor concentration (unless acceptor saturation is an issue), the 
careful control of concentrations needed for intensity measurements is not needed. It is, 
however, important to keep the illumination the same for the with- and without-acceptor 
measurements, as photobleaching increases markedly with more intense incident light. 
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Other methods 

A different, but related, mechanism is Dexter Electron Transfer. 

An alternative method to detecting protein-protein proximity is BiFC where two halves of a 
YFP are fused to a protein (Hu, Kerppola et al. 2002). When these two halves meet they 
form a fluorophore after about 60 s - 1 hr. 



Applications 



FRET has been applied in an experimental method for the detection of phosgene. In it, 
phosgene or rather triphosgene as a safe substitute serves as a linker between an acceptor 
and a donor coumarine (forming urea groups). ] The presence of phosgene is detected at 
5xlO" 5 M with a typical FRET emission at 464 nm. 

o 
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MISTAKE: The chromophore on the right must be also coumarine (double bond is missing) 



FRET is also used to study lipid rafts in cell membranes. 



[9] 



External links 

• Browser-based calculator to find the critical distance and FRET efficiency with known 
spectral overlap [ ] 



FCS 
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DNA Molecular dynamics modeling involves simulations of DNA molecular geometry 
and topology changes with time as a result of both intra- and inter- molecular interactions 
of DNA. Whereas molecular models of Deoxyribonucleic acid (DNA) molecules such as 
closely packed spheres (CPK models) made of plastic or metal wires for 'skeletal models' 
are useful representations of static DNA structures, their usefulness is very limited for 
representing complex DNA dynamics. Computer molecular modeling allows both 
animations and molecular dynamics simulations that are very important for understanding 
how DNA functions in vivo. 

An old standing dynamic problem is how DNA "self-replication" takes place in living cells 
that should involve transient uncoiling of supercoiled DNA fibers. Although DNA consists of 
relatively rigid, very large elongated biopolymer molecules called "fibers" or chains its 
molecular structure in vivo undergoes dynamic configuration changes that involve 
dynamically attached water molecules, ions or proteins/enzymes. Supercoiling, packing 
with histones in chromosome structures, and other such supramolecular aspects also 
involve in vivo DNA topology which is even more complex than DNA molecular geometry, 
thus turning molecular modeling of DNA dynamics into a series of challenging problems for 
biophysical chemists, molecular biologists and biotechnologists. Thus, DNA exists in 
multiple stable geometries (called conformational isomerism) and has a rather large 
number of configurational, quantum states which are close to each other in energy on the 
potential energy surface of the DNA molecule. 



DNA Dynamics 198 

Such varying molecular geometries can also be computed, at least in principle, by 
employing ab initio quantum chemistry methods that can attain high accuracy for small 
molecules, although claims that acceptable accuracy can be also achieved for 
polynucleotides, as well as DNA conformations, were recently made on the basis of VCD 
spectral data. Such quantum geometries define an important class of ab initio molecular 
models of DNA whose exploration has barely started especially in connection with results 
obtained by VCD in solutions. More detailed comparisons with such ab initio quantum 
computations are in principle obtainable through 2D-FT NMR spectroscopy and relaxation 
studies of polynucleotide solutions or specifically labeled DNA, as for example with 
deuterium labels. 

Importance of DNA molecular structure and dynamics 
modeling for Genomics and beyond 

From the very early stages of structural studies of DNA by X-ray diffraction and 
biochemical means, molecular models such as the Watson-Crick double-helix model were 
successfully employed to solve the 'puzzle' of DNA structure, and also find how the latter 
relates to its key functions in living cells. The first high quality X-ray diffraction patterns of 
A-DNA were reported by Rosalind Franklin and Raymond Gosling in 1953 [1] . The first 
reports of a double-helix molecular model of B-DNA structure were made by Watson and 
Crick in 1953 [2] [3] . Then Maurice F. Wilkins, A. Stokes and H.R. Wilson, reported the first 
X-ray patterns of in vivo B-DNA in partially oriented salmon sperm heads L J . The 
development of the first correct double-helix molecular model of DNA by Crick and Watson 
may not have been possible without the biochemical evidence for the nucleotide 
base-pairing ([A— T] ; [C-G]), or Chargaff's rules [5] [6] [7] [8] [9] [10] . Although such initial 
studies of DNA structures with the help of molecular models were essentially static, their 
consequences for explaining the in vivo functions of DNA were significant in the areas of 
protein biosynthesis and the quasi-universality of the genetic code. Epigenetic 
transformation studies of DNA in vivo were however much slower to develop in spite of 
their importance for embryology, morphogenesis and cancer research. Such chemical 
dynamics and biochemical reactions of DNA are much more complex than the molecular 
dynamics of DNA physical interactions with water, ions and proteins/enzymes in living cells. 

Animated DNA molecular models and hydrogen-bonding 

Animated molecular models allow one to visually explore the three-dimensional (3D) 
structure of DNA. The first DNA model is a space-filling, or CPK, model of the DNA 
double-helix whereas the third is an animated wire, or skeletal type, molecular model of 
DNA. The last two DNA molecular models in this series depict quadruplex DNA [ ] that 
may be involved in certain cancers^ J L . The first CPK model in the second row is a 
molecular model of hydrogen bonds between water molecules in ice that are broadly similar 
to those found in DNA; the hydrogen bonding dynamics and proton exchange is however 
very different by many orders of magnitude between the two systems of fully hydrated DNA 
and water molecules in ice. Thus, the DNA dynamics is complex, involving nanosecond and 
several tens of picosecond time scales, whereas that of liquid ice is on the picosecond time 
scale, and that of proton exchange in ice is on the millisecond time scale; the proton 
exchange rates in DNA and attached proteins may vary from picosecond to nanosecond, 
minutes or years, depending on the exact locations of the exchanged protons in the large 
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biopolymers. The simple harmonic oscillator 'vibration' in the third, animated image of the 
next gallery is only an oversimplified dynamic representation of the longitudinal vibrations 
of the DNA intertwined helices which were found to be anharmonic rather than harmonic as 
often assumed in quantum dynamic simulations of DNA. 
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Human Genomics and Biotechnology Applications of DNA 
Molecular Modeling 

The following two galleries of images illustrate various uses of DNA molecular modeling in 
Genomics and Biotechnology research applications from DNA repair to PCR and DNA 
nanostructures; each slide contains its own explanation and/or details. The first slide 
presents an overview of DNA applications, including DNA molecular models, with emphasis 
on Genomics and Biotechnology. 

Applications of DNA molecular dynamics computations 

• First row images present a DNA biochip and DNA nanostructures designed for DNA 
computing and other dynamic applications of DNA nanotechnology; last image in this row 
is of DNA arrays that display a representation of the Sierpinski gasket on their surfaces. 

• Second row: the first two images show computer molecular models of RNA polymerase, 
followed by that of an E. coli, bacterial DNA primase template suggesting very complex 
dynamics at the interfaces between the enzymes and the DNA template; the fourth image 
illustrates in a computed molecular model the mutagenic, chemical interaction of a 
potent carcinogen molecule with DNA, and the last image shows the different 
interactions of specific fluorescence labels with DNA in human and orangoutan 
chromosomes. 
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Image Gallery: DNA Applications and Technologies at various scales 
in Biotechnology and Genomics research 

The first figure is an actual electron micrograph of a DNA fiber bundle, presumably of a 
single plasmid, bacterial DNA loop. 
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Databases for Genomics, DNA Dynamics and Sequencing 



Genomic and structural databases 

• CBS Genome Atlas Database L J — contains examples of base skews. J 

• The Z curve database of genomes — a 3-dimensional visualization and analysis tool of 
genomes [59][14] . 

• DNA and other nucleic acids' molecular models: Coordinate files of nucleic acids 
molecular structure models in PDB and CIF formats L J 
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DNA Dynamics Data from Spectroscopy 

• FT-NMR [15] [16] 

• NMR Atlas-database [29] 

• mmcif downloadable coordinate files of nucleic acids in solution from 2D-FT NMR data 

[30] 

• NMR constraints files for NAs in PDB format [31] 

ri7i 
NMR microscopy 1 J 

Vibrational circular dichroism (VCD) 

Microwave spectroscopy 

FT-IR 

p T _ NIR [18] [19] [20] 

Spectral, Hyperspectral, and Chemical imaging) [21] [22] [23] [24] [25] [26] [27] . 

Raman spectroscopy/microscopy [ ] and CARS [ ] . 

Fluorescence correlation spectroscopy [30] [31] [32] [33] [34] [35] [36] [37] , Fluorescence 

cross-correlation spectroscopy and FRET [ ] [ ] [ . 

Confocal microscopy^ J 
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Gallery: CARS (Raman spectroscopy), Fluorescence confocal 
microscopy, and Hyperspectral imaging 
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X-ray microscopy 

• Application of X-ray microscopy in the analysis of living hydrated cells L J 

Atomic Force Microscopy (AFM) 

Two-dimensional DNA junction arrays have been visualized by Atomic Force Microscopy 
(AFM) [42] . Other imaging resources for AFM/Scanning probe microscopy(SPM) can be 
freely accessed at: 

• How SPM Works [25] 

• SPM Image Gallery - AFM STM SEM MFM NSOM and more. [26] 

Gallery of AFM Images of DNA Nanostructures 
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Molecular modeling of DNA 

Genomics 

Signal transduction 

Transcriptomics 
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Biotechnology 
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MAYA-II 
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DNA structure 

Molecular structure 

Molecular dynamics 

Molecular topology 

DNA topology 

DNA, the Genome and Interactome 

Molecular structure 

Molecular geometry fluctuations 

Molecular interactions 

Molecular topology 

Hydrogen bonding 

Hydrophobic interactions 

DNA dynamics and conformations 

DNA Conformational isomerism 

2D-FT NMRI and Spectroscopy 

Paracrystalline lattices/Paracrystals 

NMR Spectroscopy 

VCD or Vibrational circular dichroism 

Microwave spectroscopy 

Two-dimensional IR spectroscopy 

FRET and FCS- Fluorescence correlation spectroscopy 

Fluorescence cross-correlation spectroscopy (FCCS) 

Spectral imaging 

Hyperspectral imaging 

Chemical imaging 

NMR microscopy 

X-ray scattering 

Neutron scattering 

Crystallography 

Crystal lattices 

Molecular geometry 

Nanostructure 

DNA nanotechnology 

Imaging 

Sirius visualization software 
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Atomic force microscopy 

X-ray microscopy 

Liquid crystals 

Glasses 

QMC@Home 

Sir Lawrence Bragg, FRS 

Sir John Randall 

Francis Crick 

Manfred Eigen 

Felix Bloch 

Paul Lauterbur 

Maurice Wilkins 

Herbert Wilson, FRS 

Alex Stokes 

External links 

DNAlive: a web interface to compute DNA physical properties (http://mmb.pcb.ub.es/ 

DNAlive). Also allows cross-linking of the results with the UCSC Genome browser and 

DNA dynamics. 

Application of X-ray microscopy in analysis of living hydrated cells (http://www.ncbi. 

nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=pubmed&dopt=Abstract& 

list_uids=12379938) 

DiProDB: Dinucleotide Property Database (http://diprodb.fli-leibniz.de). The database 

is designed to collect and analyse thermodynamic, structural and other dinucleotide 

properties. 

DNA the Double Helix Game (http://nobelprize.org/educational_games/medicine/ 

dnadoublehelix/) From the official Nobel Prize web site 

MDDNA: Structural Bioinformatics of DNA (http://humphry.chem. wesleyan.edu:8080/ 

MDDNA/) 

Double Helix 1953-2003 (http://www.ncbe.reading.ac.uk/DNA50/) National Centre 

for Biotechnology Education 

DNA under electron microscope (http://www.fidelitysystems.com/Unlinked_DNA. 

html) 

Further details of mathematical and molecular analysis of DNA structure based on X-ray 

data (http ://planetphysics . org/encyclopedia/ 

BesselFunctionsApplicationsToDiffractionByHelicalStructures.html) 

Bessel functions corresponding to Fourier transforms of atomic or molecular helices. 

(http://planetphysics.org/?op=getobj&from=objects& 

name=BesselFunctionsAndTheirApplicationsToDiffractionByHelicalStructures) 

Characterization in nanotechnology some pdfs (http://nanocharacterization.sitesled. 

com/) 

An overview of STM/AFM/SNOM principles with educative videos (http://www.ntmdt. 

ru/SPM-Techniques/Principles/) 

SPM Image Gallery - AFM STM SEM MFM NSOM and More (http://www.rhk-tech.com/ 

results/showcase. php) 

How SPM Works (http://www.parkafm.com/New_html/resources/01general.php) 
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• U.S. National DNA Day (http://www.genome.gov/10506367) — watch videos and 
participate in real-time discussions with scientists. 

• The Secret Life of DNA - DNA Music compositions (http://www.tjmitchell.com/stuart/ 
dna.html) 

• Ascalaph DNA (http://www.agilemolecule.com/Ascalaph/Ascalaph_DNA.html) — 
Commercial software for DNA modeling 
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Part of a series of articles on 

Molecular self-assembly 



Self-assembled monolayer 

Supramolecular assembly 

DNA nanotechnology 



See also 
Nanotechnology 

DNA nanotechnology is a subfield of nanotechnology which seeks to use the unique 
molecular recognition properties of DNA and other nucleic acids to create novel, 
controllable structures out of DNA. The DNA is thus used as a structural material rather 
than as a carrier of genetic information, making it an example of bionanotechnology. This 
has possible applications in molecular self-assembly and in DNA computing. 

Introduction: DNA crossover molecules 
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Structure of the 4-arm junction. 

Left: A schematic. Right: A more realistic model. J 

Each of the four separate DNA single strands are shown in different colors. 
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DNA nanotechnology makes use of branched DNA structures to 
create DNA complexes with useful properties. DNA is normally a 
linear molecule, in that its axis is unbranched. However, DNA 
molecules containing junctions can also be made. For example, a 
four-arm junction can be made using four individual DNA strands 
which are complementary to each other in the correct pattern. Due to 
Watson-Crick base pairing, only portions of the strands which are 
complementary to each other will attach to each other to form duplex 
DNA. This four-arm junction is an immobile form of a Holliday 
junction. 

Junctions can be used in more complex molecules. The most 
important of these is the "double-crossover" or DX motif. Here, two 
DNA duplexes lie next to each other, and share two junction points 
where strands cross from one duplex into the other. This molecule 
has the advantage that the junction points are now constrained to a 
single orientation as opposed to being flexible as in the four-arm 
junction. This makes the DX motif suitible as a structural building 
block for larger DNA complexes. J 




A double-crossover 
(DX) molecule. This 
molecule consists of 

five DNA single 

strands which form 

two double-helical 

domains, on the left 

and the right in this 

image. There are two 

crossover points 

where the strands 

cross from one 

domain into the 
other. Image from 

Mao, 2004. [2] 
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Tile-based arrays 




Assembly of a DX array. Each bar 

represents a double-helical domain of DNA, 

with the shapes representing comlimentary 

sticky ends. The DX molecule at top will 

combine into the two-dimensional DNA 

array shown at bottom. Image from Mao, 

2004. [2] 



DX arrays 

DX, Double Crossover, molecules can be equipped 
with sticky ends in order to combine them into a 
two-dimenstional periodic lattice. Each DX molecule 
has four termini, one at each end of the two 
double-helical domains, and these can be equipped 
with sticky ends that program them to combine into 
a specific pattern. More than one type of DX can be 
used which can be made to arrange in rows or any 
other tessellated pattern. They thus form extended 
flat sheets which are essentially two-dimensional 
crystals of DNA. [4] 

DNA nanotubes 



In addition to flat sheets, DX arrays have been made 

to form hollow tubes of 4-20 nm diameter. These 

DNA nanotubes are somewhat similar in size and shape to carbon nanotubes, but the 

carbon nanotubes are stronger and better conductors, whereas the DNA nanotubes are 

more easily modified and connected to other structures. J 



Other tile arrays 

Two-dimensional arrays have been made out of other motifs as well, including the Holliday 
junction rhombus array as well as various DX-based arrays in the shapes of triangles and 
hexagons. J Another motif, the six-helix bundle, has the ability to form three-dimensional 
DNA arrays as well. ] 

DNA origami 

As an alternative to the tile-based approach, two-dimensional DNA structures can be made 
from a single, long DNA strand of arbitrary sequence which is folded into the desired shape 
by using shorter, "staple" strands. This allows the creation of two-dimensional shapes at the 
nanoscale using DNA. Demonstrated designs have included the smiley face and a coarse 
map of North America. DNA origami was the cover story of Nature on March 15, 2006. ] 



DNA polyhedra 

A number of three-dimensional DNA molecules have been made which have the 
connectivity of a polyhedron such as an octahedron or cube. In other words, the DNA 
duplexes trace the edges of a polyhedron with a DNA junction at each vertex. The earliest 
demonstrations of DNA polyhedra involved multiple ligations and solid-phase synthesis 
steps to create catenated polyhedra. More recently, there have been demonstrations of a 
DNA truncated octahedron made from a long single strand designed to fold into the correct 
conformation, as well as a tetrahedron which can be produced from four DNA strands in a 
single step. ] 
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DNA nanomechanical devices 

DNA complexes have been made which change their conformation upon some stimulus. 
These are intended to have applications in nanorobotics. One of the first such devices, 
called "molecular tweezers," changes from an open to a closed state based upon the 
presence of control strands. 

DNA machines have also been made which show a twisting motion. One of these makes use 
of the transition between the B-DNA and Z-DNA forms to respond to a change in buffer 
conditions. Another relies on the presence of control strands to switch from a 
paranemic-crossover (PX) conformation to a double-junction QX2) conformation. ] 

Stem Loop Controllers 

A design called a stem loop, consisting of a single strand of DNA which has a loop at an 
end, are a dynamic structure that opens and closes when a piece of DNA bonds to the loop 
part. This effect has been exploited to create several logic gates. [ ] [ ] These logic gates 
have been used to create the computers MAYA I and MAYA II which can play tick-tac-toe to 
some extent. 1 J 



Applications 

Algorithmic self-assembly 

DNA nanotechnology has been applied to 
the related field of DNA computing. A DX 
array has been demonstrated whose 
assembly encodes an XOR operation, which 
allows the DNA array to implement a 
cellular automaton which generates a 
fractal called the Sierpinski gasket. This 
shows that computation can be 
incorporated into the assembly of DNA 
arrays, increasing its scope beyond simple 
periodic arrays. 

Note that DNA computing overlaps with, 

but is distinct from, DNA nanotechnology. 

The latter uses the specificity of 

Watson-Crick basepairing to make novel 

structures out of DNA. These structures can be used for DNA computing, but they do not 

have to be. Additionally, DNA computing can be done without using the types of molecules 

made possible by DNA Nanotechnology. 5] 
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Nanoarchitecture 

The idea of using DNA arrays to 
template the assembly of other 
functional molecules has been 
around for a while, but only 
recently has progress been made 
in reducing these kinds of schemes 
to practice. In 2006, researchers 
covalently attached gold 

nanoparticles to a DX-based tile 
and showed that self-assembly of 
the DNA structures also assembled 
the nanoparticles hosted on them. 
A non-covalent hosting scheme 
was shown in 2007, using Dervan 
polyamides on a DX array to 
arrange streptavidin proteins on 
specific kinds of tiles on the DNA 
array. J Previously in 2006 
LaBean demonstrated the letters 
"D" "N" and "A" created on a 4x4 DX array using streptavidin. [17] 

DNA has also been used to assemble a single walled carbon nanotube Field-effect 
transistor. ] 




DNA arrays that display a representation of the Sierpinski gasket 

on their surfaces. Click the image for further details. Image from 

Rothemund et al, 2004. [14] 



See also 

• Mechanical properties of DNA 



External links 

Chengde Mao page at Purdue University [19] 

John Reif lab at Duke University [20] 

Nadrian Seeman lab at NYU [21] 

William M. Shih lab at Harvard Medical School [22] 

Andrew Turberfield lab at Oxford University [23] 

Erik Winfree lab at Caltech [24] 

Hao Yan lab at Arizona State University [25] 

Bernard Yurke formerly at Bell Labs [26] now at Boise State University [27] 

Thorn LaBean at Duke University [28] 

Software for 3D DNA design, modeling and/or simulation: 

• Ascalaph Designer [29] 

• caDNAno [30] 

• GIDEON [31] 

• NanoEngineer-1 L J 
International Society for Nanoscale Science, Computation and Engineering [33] 



DNA nanotechnology 217 

References 

Note: Click on the doi to access the text of the referenced article. 

[1] Created from PDB 1M6G (http://www.rcsb. org/pdb/explore/explore.do?structureId=lM6G) 
[2] http://dx.doi.org/10.1371/journal.pbio.0020431 

• Seeman, Nadrian C. (1 November 1999). "DNA Engineering and its Application to Nanotechnology". Trends 
in Biotechnology 17 (11): 437-443. doi: 10.1016/S0167-7799(99)01360-8 (http://dx.doi.org/10.1016/ 
S0167-7799(99)01360-8). ISSN 0167-7799 (http://worldcat.org/issn/0167-7799). 

• Seeman, Nadrian C. (January 2001). "DNA Nicks and Nodes and Nanotechnology". Nano Letters 1 (1): 
22-26. doi: 10.1021/nl000182v (http://dx.doi.org/10.1021/nl000182v). ISSN 1530-6984 (http://worldcat. 
org/issn/1530-6984). 

• Mao, Chengde (December 2004). "The Emergence of Complexity: Lessons from DNA". PLoS Biology 2 (12): 
2036-2038. doi: 10. 1371/journal.pbio. 0020431 (http://dx.doi.org/10.1371/journal.pbio.0020431). ISSN 
1544-9173 (http://worldcat.org/issn/1544-9173). 

• Kumara, Mudalige T. O u ly 2008). "Assembly pathway analysis of DNA nanostructures and the construction of 
parallel motifs". Nano Letters 8 (7): 1971-1977. doi: 10.1021/nl800907y (http://dx.doi.org/10.1021/ 
nl800907y). ISSN . 

• Winfree, Erik; Liu, Furong; Wenzler, Lisa A. & Seeman, Nadrian C. (6 August 1998). "Design and 
self-assembly of two-dimensional DNA crystals". Nature 394: 529-544. doi: 10.1038/28998 (http://dx.doi. 
org/10.1038/28998). ISSN 0028-0836 (http://worldcat.org/issn/0028-0836). 

• Liu, Furong; Sha, Ruojie & Seeman, Nadrian C. (10 February 1999). "Modifying the Surface Features of 
Two-Dimensional DNA Crystals". Journal of the American Chemical Society 121 (5): 917-922. doi: 
10.1021/ja982824a (http://dx.doi.org/10.1021/ja982824a). ISSN 0002-7863 (http://worldcat.org/issn/ 
0002-7863). 

• Rothemund, Paul W. K.; Ekani-Nkodo, Axel; Papadakis, Nick; Kumar, Ashish; Fygenson, Deborah Kuchnir & 
Winfree, Erik (22 December 2004). "Design and Characterization of Programmable DNA Nanotubes". Journal 
of the American Chemical Society 126 (50): 16344-16352. doi: 10.1021/ja0443191 (http://dx.doi.org/10. 
1021/ja0443191). ISSN 0002-7863 (http://worldcat.org/issn/0002-7863). 

• Mao, Chengde; Sun, Weiqiong & Seeman, Nadrian C. (16 June 1999). "Designed Two-Dimensional DNA 
Holliday Junction Arrays Visualized by Atomic Force Microscopy". Journal of the American Chemical Society 
121 (23): 5437-5443. doi: 10.1021/ja9900398 (http://dx.doi.org/10.1021/ja9900398). ISSN 0002-7863 
(http://worldcat.org/issn/0002-7863). 

• Constantinou, Pamela E.; Wang, Tong; Kopatsch, Jens; Israel, Lisa B.; Zhang, Xiaoping; Ding, Baoquan; 
Sherman, William B.; Wang, Xing; Zheng, Jianping; Sha, Ruojie & Seeman, Nadrian C. (2006). "Double 
cohesion in structural DNA nanotechnology". Organic and Biomolecular Chemistry 4: 3414-3419. doi: 
10. 1039/b605212f (http://dx.doi.org/10.1039/b605212f). 

• Mathieu, Frederick; Liao, Sniping; Kopatsch, Jens; Wang, Tong; Mao, Chengde & Seeman, Nadrian C. (April 
2005). "Six-Helix Bundles Designed from DNA". Nano Letters 5 (4): 661-665. doi: 10.1021/nl050084f (http:// 
dx.doi.org/10.1021/nl050084f). ISSN 1530-6984 (http://worldcat.org/issn/1530-6984). 

• Rothemund, Paul W. K. (2006). "Folding DNA to create nanoscale shapes and patterns". Nature 440: 
297-302. doi: 10.1038/nature04586 (http://dx.doi.org/10.1038/nature04586). ISSN 0028-0836 (http:// 
worldcat.org/issn/0028-0836). 

• Zhang, Yuwen; Seeman, Nadrian C. (1994). "Construction of a DNA-truncated octahedron". Journal of the 
American Chemical Society 116 (5): 1661-1669. doi: 10.1021/ja00084a006 (http://dx.doi.org/10.1021/ 
ja00084a006). ISSN 0002-7863 (http://worldcat.org/issn/0002-7863). 

• Shih, William M. ; Quispe, Joel D. ; Joyce, Gerald F. (12 February 2004). "A 1.7-kilobase single-stranded DNA 
that folds into a nanoscale octahedron". Nature 427: 618-621. doi: 10.1038/nature02307 (http://dx.doi. 
org/10. 1038/nature02307). ISSN 0028-0836 (http://worldcat.org/issn/0028-0836). 

• Goodman, R.P.; Schaap, I.A.T.; Tardin, C.F.; Erben, CM.; Berry, R.M.; Schmidt, C.F.; Turberfield, A.J. (9 
December 2005). "Rapid chiral assembly of rigid DNA building blocks for molecular nanofabrication". 
Science 310 (5754): 1661-1665. doi: 10.1126/science.ll20367 (http://dx.doi.org/10.1126/science. 
1120367). ISSN 0036-8075 (http://worldcat.org/issn/0036-8075). 

• Yurke, Bernard; Turberfield, Andrew J.; Mills, Allen P., Jr; Simmel, Friedrich C. & Neumann, Jennifer L. (10 
August 2000). "A DNA-fuelled molecular machine made of DNA". Nature 406: 605-609. doi: 
10.1038/35020524 (http://dx.doi.org/10.1038/35020524). ISSN 0028-0836 (http://worldcat.org/issn/ 



DNA nanotechnology 218 

0028-0836). 

• Mao, Chengde; Sun, Weiqiong; Shen, Zhiyong & Seeman, Nadrian C. (14 January 1999). "A DNA 
Nanomechanical Device Based on the B-Z Transition". Nature 397: 144-146. doi: 10.1038/16437 (http://dx. 
doi.org/10. 1038/16437). ISSN . 

• Yan, Hao; Zhang, Xiaoping; Shen, Zhiyong & Seeman, Nadrian C. (3 January 2002). "A robust DNA 
mechanical device controlled by hybridization topology". Nature 415: 62-65. doi: 10.1038/41 5062a (http:// 
dx.doi.org/10.1038/415062a). ISSN . 

[11] DNA Logic Gates (https://digamma.cs.unm.edu/wiki/bin/view/McogPublicWeb/MolecularLogicGates) 
[12] (http://www.duke.edu/~jmel7/Joshua_E._Mendoza-Elias/Research_Ideas.html) 
[13] MAYA II (https://digamma.cs.unm.edu/wiki/bin/view/McogPublicWeb/MolecularAutomataMAYAII) 
[14] http://dx.doi.org/10.1371/journal.pbio.0020424 

• Rothemund, Paul W. K.; Papadakis, Nick & Winfree, Erik (December 2004). "Algorithmic Self-Assembly of 
DNA Sierpinski Triangles". PLoS Biology 2 (12): 2041-2053. doi: 10. 1371/journal.pbio. 0020424 (http://dx. 
doi.org/10.1371/journal.pbio.0020424). ISSN 1544-9173 (http://worldcat.org/issn/1544-9173). 

• Robinson, Bruche H.; Seeman, Nadrian C. (August 1987). "The Design of a Biochip: A Self-Assembling 
Molecular-Scale Memory Device". Protein Engineering 1 (4): 295-300. ISSN 0269-2139 (http://worldcat. 
org/issn/0269-2139). Link (http://peds.oxfordjournals.Org/cgi/content/abstract/l/4/295) 

• Zheng, Jiwen; Constantinou, Pamela E.; Micheel, Christine; Alivisatos, A. Paul; Kiehl, Richard A. & Seeman 
Nadrian C. (2006). "2D Nanoparticle Arrays Show the Organizational Power of Robust DNA Motifs". Nano 
Letters 6: 1502-1504. doi: 10.1021/nl060994c (http://dx.doi.org/10.1021/nl060994c). ISSN 1530-6984 
(http://worldcat.org/issn/1530-6984). 

• Cohen, Justin D.; Sadowski, John P.; Dervan, Peter B. (2007). "Addressing Single Molecules on DNA 
Nanostructures". Ange wandte Chemie 46 (42): 7956-7959. doi: 10.1002/anie.200702767 (http://dx.doi. 
org/10. 1002/anie. 200702767). ISSN 0570-0833 (http://worldcat.org/issn/0570-0833). 

[17] Park, Sung Ha; Sung Ha Park, Constantin Pistol, Sang Jung Ahn, John H. Reif, Alvin R. Lebeck, Chris Dwyer, 
Thomas H. LaBean (October 2006). 

"http://www3.interscience.wiley.com/journal/113390879/abstractlFinite-Size, Fully Addressable DNA Tile 
Lattices Formed by Hierarchical Assembly Procedures". Angewandte Chemie 118 (40): 749-753. doi: 
10. 1002/ange. 200690141 (http://dx.doi.org/10.1002/ange.200690141). ISSN 1521-3757 (http://worldcat. 
org/issn/1 52 1-3757). http://www3.interscience.wiley.com/journal/113390879/abstract. 

[18] Keren, K.; Kinneret Keren, Rotem S. Berman, Evgeny Buchstab, Uri Sivan, Erez Braun (November 2003). 
"http://www.sciencemag.org/cgi/content/abstract/sci;302/5649/1380|DNA-Templated Carbon Nanotube 
Field-Effect Transistor". Science 302 (6549): 1380-1382. doi: 10.1126/science.l091022 (http://dx.doi.org/10. 
1126/science. 1091022). ISSN 1095-9203 (http://worldcat.org/issn/1095-9203). http://www.sciencemag. 
org/cgi/content/abstract/sci;302/5649/1380. 

[19] http://www.chem. purdue.edu/people/faculty/faculty.asp?itemID=46 

[20] http://www.cs.duke.edu/~reif/BMC/Reif.BMCproject.html 

[21] http://seemanlab4.chem.nyu.edu/ 

[22] http://research2.dfci.harvard.edu/shih/SHIH_LAB/Home.html 

[23] http ://www. physics . ox. ac . uk/cm/people/turberfield. htm 

[24] http://dna.caltech.edu/ 

[25] http://chemistry.asu.edu/faculty/hao_yan.asp 

[26] http://www.bell-labs.com/org/physicalsciences/profiles/yurke.html 

[27] http://coen.boisestate.edu/departments/faculty.asp?ID=134 

[28] http://www.cs.duke.edu/~thl/ 

[29] http://www.agilemolecule.com/Ascalaph/Ascalaph_Designer.html 

[30] http://cadnano.org 

[31] http://www.subirac.com 

[ 32 ] http ://www. nanoengineer- 1 . net 

[33] http://www.isnsce.org/ 



Imaging 219 

Imaging 

Imaging is the representation or reproduction of an object's outward form; especially a 
visual representation (i.e., the formation of an image). 

Imaging methodologies and technologies 

• Chemical imaging, the simultaneous measurement of spectra and pictures 

• Creation of a disk image, a file which contains the exact content of a non-volatile 
computer data storage medium. See also disk cloning 

• Digital imaging, creating digital images, generally by scanning, or through digital 
photography 

• Document imaging, replicating documents commonly used in business 

• Geophysical imaging 

• Medical imaging, creating images of the human body or parts of it, to diagnose or 
examine disease 

• Magnetic resonance imaging 

• Molecular imaging 

• Optical imaging 

• Personal imaging, real-time sharing of personal experience through images 

• Radar imaging, or imaging radar, for obtaining an image of an object, not just its location 
and speed 

• Reprography, reproduction of graphics through electrical and mechanical means 

• Cinematography 

• Photography, the process of creating still images 

• Xerography, the method of photocopying 

• Speckle imaging, a method of shift-and-add for astronomical imaging 

• Stereo imaging, an aspect of sound recording and reproduction concerning spatial 
locations of the performers 

• Thermography, infrared imaging 

Proper names 

• Imaging for Windows, a software product for scanning paper documents 

See also 

• Image processing 

• Imaging technology 

• Imaging science, which includes many fields of science 

• Remote sensing, imaging the Earth or a planet from space or aircraft 
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Atomic force microscope 



The atomic force microscope (AFM) or 
scanning force microscope (SFM) is a very 
high-resolution type of scanning probe 
microscopy, with demonstrated resolution 
of fractions of a nanometer, more than 
1000 times better than the optical 
diffraction limit. The precursor to the AFM, 
the scanning tunneling microscope, was 
developed by Gerd Binnig and Heinrich 
Rohrer in the early 1980s, a development 
that earned them the Nobel Prize for 
Physics in 1986. Binnig, Quate and Gerber 
invented the first AFM in 1986. The AFM is 
one of the foremost tools for imaging, 
measuring and manipulating matter at the 
nanoscale. The information is gathered by 
"feeling" the surface with a mechanical 
probe. Piezoelectric elements that facilitate 
tiny but accurate and precise movements 
on (electronic) command enable the very 
precise scanning. 
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AFM cantilever (after use) in the 

Scanning Electron Microscope, 

magnification 1,000 x (image width ~ 

100 micrometers) 



The AFM consists of a microscale cantilever with a 
sharp tip (probe) at its end that is used to scan the 
specimen surface. The cantilever is typically silicon or 
silicon nitride with a tip radius of curvature on the 
order of nanometers. When the tip is brought into 
proximity of a sample surface, forces between the tip 
and the sample lead to a deflection of the cantilever 
according to Hooke's law. Depending on the situation, 
forces that are measured in AFM include mechanical 
contact force, Van der Waals forces, capillary forces, 
chemical bonding, electrostatic forces, magnetic forces 
(see Magnetic force microscope (MFM)), Casimir 
forces, solvation forces etc. As well as force, additional 
quantities may simultaneously be measured through the 
use of specialised types of probe (see Scanning thermal 
microscopy, photothermal microspectroscopy, etc.). 
Typically, the deflection is measured using a laser spot 
reflected from the top surface of the cantilever into an 
array of photodiodes. Other methods that are used 
include optical interferometry, capacitive sensing or 
piezoresistive AFM cantilevers. These cantilevers are 
fabricated with piezoresistive elements that act as a 
strain gauge. Using a Wheatstone bridge, strain in the 
AFM cantilever due to deflection can be measured, but 
this method is not as sensitive as laser deflection or 
interferometry. 



damage. Hence, in most cases 
tip-to-sample distance to maintain 



If the tip was scanned at a constant height, a risk would 
exist that the tip collides with the surface, causing 
a feedback mechanism is employed to adjust the 
a constant force between the tip and the 
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sample. Traditionally, the sample is mounted on a 

piezoelectric tube, that can move the sample in the z 

direction for maintaining a constant force, and the x 

and y directions for scanning the sample. Alternatively 

a 'tripod' configuration of three piezo crystals may be 

employed, with each responsible for scanning in the x,y 

and z directions. This eliminates some of the distortion 

effects seen with a tube scanner. In newer designs, the 

tip is mounted on a vertical piezo scanner while the 

afm cantilever (after use) in the sample is being scanned in X and Y using another piezo 

Scanning Electron Microscope, block. The resulting map of the area s = f(x 7 y) 

magnification 3,000 x (image width - represents the topography of the sample. 

30 micrometers) 




The AFM can be operated in a number of modes, depending on the application. In general, 
possible imaging modes are divided into static (also called Contact) modes and a variety of 
dynamic (or non-contact) modes where the cantilever is vibrated. 

Imaging modes 

The primary modes of operation are static (contact) mode and dynamic mode. In the static 
mode operation, the static tip deflection is used as a feedback signal. Because the 
measurement of a static signal is prone to noise and drift, low stiffness cantilevers are used 
to boost the deflection signal. However, close to the surface of the sample, attractive forces 
can be quite strong, causing the tip to 'snap-in' to the surface. Thus static mode AFM is 
almost always done in contact where the overall force is repulsive. Consequently, this 
technique is typically called 'contact mode'. In contact mode, the force between the tip and 
the surface is kept constant during scanning by maintaining a constant deflection. 

In the dynamic mode, the cantilever is externally oscillated at or close to its fundamental 
resonance frequency or a harmonic. The oscillation amplitude, phase and resonance 
frequency are modified by tip-sample interaction forces; these changes in oscillation with 
respect to the external reference oscillation provide information about the sample's 
characteristics. Schemes for dynamic mode operation include frequency modulation and the 
more common amplitude modulation. In frequency modulation, changes in the oscillation 
frequency provide information about tip-sample interactions. Frequency can be measured 
with very high sensitivity and thus the frequency modulation mode allows for the use of 
very stiff cantilevers. Stiff cantilevers provide stability very close to the surface and, as a 
result, this technique was the first AFM technique to provide true atomic resolution in 
ultra-high vacuum conditions (Giessibl). 

In amplitude modulation, changes in the oscillation amplitude or phase provide the 
feedback signal for imaging. In amplitude modulation, changes in the phase of oscillation 
can be used to discriminate between different types of materials on the surface. Amplitude 
modulation can be operated either in the non-contact or in the intermittent contact regime. 
In ambient conditions, most samples develop a liquid meniscus layer. Because of this, 
keeping the probe tip close enough to the sample for short-range forces to become 
detectable while preventing the tip from sticking to the surface presents a major hurdle for 
the non-contact dynamic mode in ambient conditions. Dynamic contact mode (also called 
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intermittent contact or tapping mode) was developed to bypass this problem (Zhong et al.). 
In dynamic contact mode, the cantilever is oscillated such that the separation distance 
between the cantilever tip and the sample surface is modulated. 

Amplitude modulation has also been used in the non-contact regime to image with atomic 
resolution by using very stiff cantilevers and small amplitudes in an ultra-high vacuum 
environment. 



Tapping Mode 

In tapping mode the cantilever is driven to oscillate 

up and down at near its resonance frequency by a 

small piezoelectric element mounted in the AFM tip 

holder. The amplitude of this oscillation is greater 

than 10 nm, typically 100 to 200 nm. Due to the 

interaction of forces acting on the cantilever when 

the tip comes close to the surface, Van der Waals 

force or dipole-dipole interaction, electrostatic 

forces, etc cause the amplitude of this oscillation to 

decrease as the tip gets closer to the sample. An 

electronic servo uses the piezoelectric actuator to 

control the height of the cantilever above the 

sample. The servo adjusts the height to maintain a 

set cantilever oscillation amplitude as the cantilever 

is scanned over the sample. A Tapping AFM image 

is therefore produced by imaging the force of the 

oscillating contacts of the tip with the sample 

surface. This is an improvement on conventional 

contact AFM, in which the cantilever just drags 

across the surface at constant force and can result in surface damage. Tapping mode is 

gentle enough even for the visualization of supported lipid bilayers or adsorbed single 

polymer molecules (for instance, 0.4 nm thick chains of synthetic polyelectrolytes) under 

liquid medium. At the application of proper scanning parameters, the conformation of 

single molecules remains unchanged for hours (Roiter and Minko, 2005). 




Single polymer chains (0.4 nm thick) 
recorded in a tapping mode under aqueous 
media with different pH. Green locations of 
the two-chains-superposition correspond to 
0.8 nm thickness (Roiter and Minko, 2005). 



Non-Contact Mode 

Here the tip of the cantilever does not contact the sample surface. The cantilever is instead 
oscillated at a frequency slightly above its resonance frequency where the amplitude of 
oscillation is typically a few nanometers (<10nm). The van der Waals forces, which are 
strongest from lnm to lOnm above the surface, or any other long range force which 
extends above the surface acts to decrease the resonance frequency of the cantilever. This 
decrease in resonance frequency combined with the feedback loop system maintains a 
constant oscillation amplitude or frequency by adjusting the average tip-to-sample distance. 
Measuring the tip-to-sample distance at each (x,y) data point allows the scanning software 
to construct a topographic image of the sample surface. 

Non-contact mode AFM does not suffer from tip or sample degradation effects that are 
sometimes observed after taking numerous scans with contact AFM. This makes 
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non-contact AFM preferable to contact AFM for measuring soft samples. In the case of rigid 
samples, contact and non-contact images may look the same. However, if a few monolayers 
of adsorbed fluid are lying on the surface of a rigid sample, the images may look quite 
different. An AFM operating in contact mode will penetrate the liquid layer to image the 
underlying surface, whereas in non-contact mode an AFM will oscillates above the adsorbed 
fluid layer to image both the liquid and surface. 



AFM -Beam Deflection Detection 







Cantilever ft Up 



Laser light from a solid state diode is reflected off the 

back of the cantilever and collected by a position 

sensitive detector (PSD) consisting of two closely 

spaced photodiodes whose output signal is collected by 

a differential amplifier. Angular displacement of 

cantilever results in one photodiode collecting more 

light than the other photodiode, producing an output 

signal (the difference between the photodiode signals 

normalized by their sum) which is proportional to the 

deflection of the cantilever. It detects cantilever 

deflections <lA (thermal noise limited). A long beam path (several cm) amplifies changes in 

beam angle. 



AFM - Non-Contact Mode 
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AFM Beam Deflection Detection 



Force spectroscopy 

Another major application of AFM (besides imaging) is 

force spectroscopy, the measurement of force-distance 

curves. For this method, the AFM tip is extended 

towards and retracted from the surface as the static 

deflection of the cantilever is monitored as a function of 

piezoelectric displacement. These measurements have 

been used to measure nanoscale contacts, atomic 

bonding, Van der Waals forces, and Casimir forces, 

dissolution forces in liquids and single molecule stretching and rupture forces (Hinterdorfer 

& Dufrene). Forces of the order of a few pico-Newton can now be routinely measured with a 

vertical distance resolution of better than 0.1 nanometer. 

Problems with the technique include no direct measurement of the tip-sample separation 
and the common need for low stiffness cantilevers which tend to 'snap' to the surface. The 
snap-in can be reduced by measuring in liquids or by using stiffer cantilevers, but in the 
latter case a more sensitive deflection sensor is needed. By applying a small dither to the 
tip, the stiffness (force gradient) of the bond can be measured as well (Hoffmann et al.). 
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Identification of individual surface atoms 

The AFM can be used to image and manipulate atoms 
and structures on a variety of surfaces. The atom at the 
apex of the tip "senses" individual atoms on the 
underlying surface when it forms incipient chemical 
bonds with each atom. Because these chemical 
interactions subtly alter the tip's vibration frequency, 
they can be detected and mapped. 

Physicist Oscar Custance (Osaka University, Graduate 
School of Engineering, Osaka, Japan) and his team used 
this principle to distinguish between atoms of silicon, 
tin and lead on an alloy surface (Nature 2007, 446, 64). 

The trick is to first measure these forces precisely for 
each type of atom expected in the sample. The team 
found that the tip interacted most strongly with silicon 
atoms, and interacted 23% and 41% less strongly with 
tin and lead atoms, respectively. Thus, each different 
type of atom can be identified in the matrix as the tip is moved across the surface. 

Such a technique has been used now in biology and extended recently to cell biology. 
Forces corresponding to (i) the unbinding of receptor ligand couples (ii) unfolding of 
proteins (iii) cell adhesion at single cell scale have been gathered. 




The atoms of a Sodium Chloride crystal 

viewed with an Atomic Force 

Microscope 



Advantages and disadvantages 




The AFM has several advantages over the scanning 

electron microscope (SEM). Unlike the electron 

microscope which provides a two-dimensional 

projection or a two-dimensional image of a sample, the 

AFM provides a true three-dimensional surface profile. 

Additionally, samples viewed by AFM do not require 

any special treatments (such as metal/carbon coatings) 

that would irreversibly change or damage the sample. 

While an electron microscope needs an expensive 

vacuum environment for proper operation, most AFM 

modes can work perfectly well in ambient air or even a 

liquid environment. This makes it possible to study 

biological macromolecules and even living organisms. In principle, AFM can provide higher 

resolution than SEM. It has been shown to give true atomic resolution in ultra-high vacuum 

(UHV) and, more recently, in liquid environments. High resolution AFM is comparable in 

resolution to Scanning Tunneling Microscopy and Transmission Electron Microscopy. 

A disadvantage of AFM compared with the scanning electron microscope (SEM) is the 
image size. The SEM can image an area on the order of millimetres by millimetres with a 
depth of field on the order of millimetres. The AFM can only image a maximum height on 
the order of micrometres and a maximum scanning area of around 150 by 150 micrometres. 



The first Atomic Force Microscope 
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Another inconvenience is that an incorrect choice of tip for the required resolution can lead 
to image artifacts. Traditionally the AFM could not scan images as fast as an SEM, 
requiring several minutes for a typical scan, while a SEM is capable of scanning at near 
real-time (although at relatively low quality) after the chamber is evacuated. The relatively 
slow rate of scanning during AFM imaging often leads to thermal drift in the image 
(Lapshin, 2004, 2007), making the AFM microscope less suited for measuring accurate 
distances between artifacts on the image. However, several fast-acting designs were 
suggested to increase microscope scanning productivity (Lapshin and Obyedkov, 1993) 
including what is being termed videoAFM (reasonable quality images are being obtained 
with videoAFM at video rate - faster than the average SEM). To eliminate image distortions 
induced by thermodrift, several methods were also proposed (Lapshin, 2004, 2007). 

AFM images can also be affected by hysteresis of the piezoelectric material (Lapshin, 1995) 
and cross-talk between the (x,y,z) axes that may require software enhancement and 
filtering. Such filtering could "flatten" out real topographical features. However, newer 
AFM use real-time correction software (for example, feature-oriented scanning, Lapshin, 
2004, 2007) or closed-loop scanners which practically eliminate these problems. Some AFM 
also use separated orthogonal scanners (as opposed to a single tube) which also serve to 
eliminate cross-talk problems. 

Due to the nature of AFM probes, they cannot normally measure steep walls or overhangs. 
Specially made cantilevers can be modulated sideways as well as up and down (as with 
dynamic contact and non-contact modes) to measure sidewalls, at the cost of more 
expensive cantilevers and additional artifacts. 

Piezoelectric Scanners 

AFM scanners are made from piezoelectric material, which expands and contracts 
proportionally to an applied voltage. Whether they elongate or contract depends upon the 
polarity of the voltage applied. The scanner is constructed by combining independently 
operated piezo electrodes for X, Y, & Z into a single tube, forming a scanner which can 
manipulate samples and probes with extreme precision in 3 dimensions. 

Scanners are characterized by their sensitivity which is 
the ratio of piezo movement to piezo voltage, i.e. by 
how much the piezo material extends or contracts per 
applied volt. Because of differences in material or size, 
the sensitivity varies from scanner to scanner. 

Sensitivity varies non-linearly with respect to scan size. 
Piezo scanners exhibit more sensitivity at the end than 
at the beginning of a scan. This causes the forward and ^^^^^^^^^^^S^^TL^ 

"-* "-* raster motion m X ana Y Then* are rtvo waiifflK of uie pieroe Iff bit cryitol for X (X & X) and Y 

reverse scans to behave differently and display 

hysteresis between the two scan directions. This can be iezoe e( 11( (cllinei 

corrected by applying a non-linear voltage to the piezo 

electrodes to cause linear scanner movement and calibrating the scanner accordingly. 

The sensitivity of piezoelectric materials decreases exponentially with time. This causes 
most of the change in sensitivity to occur in the initial stages of the scanner's life. 
Piezoelectric scanners are run for approximately 48 hours before they are shipped from the 
factory so that they are past the point where we can expect large changes in sensitivity. As 
the scanner ages, the sensitivity will change less with time and the scanner would seldom 



Typical scantier piezo tube andX-Y-Z confiBuratoons. AC Sijaiah applied ro conductive areas of 
the tube create piezo DB\emenl^lona the three major axes 
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require recalibration. 

See also 

• Interfacial force microscope 

• Friction force microscope 

• Scanning tunneling microscope 

• Scanning probe microscopy 

• Scanning voltage microscopy 
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An electron microscope is a type of microscope that 
uses a particle beam of electrons to illuminate a 
specimen and create a highly-magnified image. 
Electron microscopes have much greater resolving 
power than light microscopes that use 
electromagnetic radiation and can obtain much higher 
magnifications of up to 2 million times, while the best 
light microscopes are limited to magnifications of 
2000 times. Both electron and light microscopes have 
resolution limitations, imposed by the wavelength of 
the radiation they use. The greater resolution and 
magnification of the electron microscope is because 
the wavelength of an electron, its de Broglie 
wavelength, is much smaller than that of a photon of 
visible light. 

The electron microscope uses electrostatic and electromagnetic lenses in forming the 
image by controlling the electron beam to focus it at a specific plane relative to the 
specimen in a manner similar to how a light microscope uses glass lenses to focus light on 
or through a specimen to form an image. 



Transmission Electron Microscope 

Diagram of a transmission electron 
microscope 
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History 

The first electron microscope prototype was built in 
1931 by the German engineers Ernst Ruska and Max 
Knoll. J Although this initial instrument was capable of 
magnifying objects by only four hundred times, it 
demonstrated the principles of an electron microscope. 
Two years later, Ruska constructed an electron 
microscope that exceeded the resolution possible with 
an optical microscope. 



[l] 



Reinhold Rudenberg, the scientific director of Siemens, 
had patented the electron microscope in 1931, 
stimulated by family illness to make the poliomyelitis 
virus particle visible. In 1937 Siemens began funding 
Ruska and Bodo von Borries to develop an electron 
microscope. Siemens also employed Ruska's brother 
Helmut to work on applications, particularly with 
biological specimens. ] [ ] 




Electron microscope constructed by 
Ernst Ruska in 1933 



In the same decade Manfred von Ardenne pioneered 
the scanning electron microscope and his universal 
electron microscope. J 

Siemens produced the first commercial Transmision 

Electron Microscope (TEM) in 1939, but the first 

practical electron microscope had been built at the University of Toronto in 1938, by Eli 

Franklin Burton and students Cecil Hall, James Hillier, and Albert Prebus. J 

Although modern electron microscopes can magnify objects up to two million times, they 
are still based upon Ruska's prototype. The electron microscope is an essential item of 
equipment in many laboratories. Researchers use them to examine biological materials 
(such as microorganisms and cells), a variety of large molecules, medical biopsy samples, 
metals and crystalline structures and the characteristics of various surfaces. The electron 
microscope is also used extensively for inspection, quality assurance and failure analysis 
applications in industry, including, in particular, semiconductor device fabrication. 



Types 

Transmission Electron Microscope (TEM) 

The original form of electron microscope, the transmission electron microscope (TEM) uses 
a high voltage electron beam to create an image. The electrons are emitted by an electron 
gun, commonly fitted with a tungsten filament cathode as the electron source. The electron 
beam is accelerated by an anode typically at +100keV (40 to 400 keV) with respect to the 
cathode, focused by electrostatic and electromagnetic lenses, and transmitted through the 
specimen that is in part transparent to electrons and in part scatters them out of the beam. 
When it emerges from the specimen, the electron beam carries information about the 
structure of the specimen that is magnified by the objective lens system of the microscope. 
The spatial variation in this information (the "image") is viewed by projecting the magnified 
electron image onto a fluorescent viewing screen coated with a phosphor or scintillator 
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material such as zinc sulfide. The image can be photographically recorded by exposing a 
photographic film or plate directly to the electron beam, or a high-resolution phosphor may 
be coupled by means of a lens optical system or a fibre optic light-guide to the sensor of a 
CCD (charge-coupled device) camera. The image detected by the CCD may be displayed on 
a monitor or computer. 

Resolution of the TEM is limited primarily by spherical aberration, but a new generation of 
aberration correctors have been able to partially overcome spherical aberration to increase 
resolution. Software correction of spherical aberration for the High Resolution TEM 
(HRTEM) has allowed the production of images with sufficient resolution to show carbon 
atoms in diamond separated by only 0.89 angstrom (89 picometers) and atoms in silicon at 
0.78 angstrom (78 picometers)^ J L J at magnifications of 50 million times. J The ability to 
determine the positions of atoms within materials has made the HRTEM an important tool 
for nano-technologies research and development. J 

Scanning Electron Microscope (SEM) 

Unlike the TEM, where electrons of the high voltage 

beam carry the image of the specimen, the electron 

beam of the Scanning Electron Microscope (SEM) 1 - J 

does not at any time carry a complete image of the 

specimen. The SEM produces images by probing the 

specimen with a focused electron beam that is scanned 

across a rectangular area of the specimen (raster 

scanning). At each point on the specimen the incident 

electron beam loses some energy, and that lost energy 

is converted into other forms, such as heat, emission of 

low-energy secondary electrons, light emission B 

(cathodoluminescence) or x-ray emission. The display of An image of an ant from a scanning 

electron microscope 
the SEM maps the varying intensity of any of these 

signals into the image in a position corresponding to 

the position of the beam on the specimen when the signal was generated. In the SEM image 

of an ant shown at right, the image was constructed from signals produced by a secondary 

electron detector, the normal or conventional imaging mode in most SEMs. 

Generally, the image resolution of an SEM is about an order of magnitude poorer than that 
of a TEM. However, because the SEM image relies on surface processes rather than 
transmission it is able to image bulk samples up to several centimetres in size (depending 
on instrument design) and has a much greater depth of view, and so can produce images 
that are a good representation of the 3D structure of the sample. 

Reflection Electron Microscope (REM) 

In the Reflection Electron Microscope (REM) as in the TEM, an electron beam is 
incident on a surface, but instead of using the transmission (TEM) or secondary electrons 
(SEM), the reflected beam of elastically scattered electrons is detected. This technique is 
typically coupled with Reflection High Energy Electron Diffraction (RHEED) and Reflection 
high-energy loss spectrum (RHELS). Another variation is Spin-Polarized Low-Energy 

Electron Microscopy (SPLEEM), which is used for looking at the microstructure of magnetic 

nil 
domains. J 
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Scanning Transmission Electron Microscope (STEM) 

The STEM rasters a focused incident probe across a specimen that (as with the TEM) has 
been thinned to facilitate detection of electrons scattered through the specimen. The high 
resolution of the TEM is thus possible in STEM. The focusing action (and aberrations) occur 
before the electrons hit the specimen in the STEM, but afterward in the TEM. The STEM'S 
use of SEM-like beam rastering simplifies annular dark-field imaging, and other analytical 
techniques, but also means that image data is acquired in serial rather than in parallel 
fashion. 




An insect coated in gold for viewing 
with a scanning electron microscope. 



Sample preparation 

Materials to be viewed under an electron microscope 
may require processing to produce a suitable sample. 
The technique required varies depending on the 
specimen and the analysis required: 

• Chemical Fixation for biological specimens aims to 
stabilize the specimen's mobile macromolecular 
structure by chemical crosslinking of proteins with 
aldehydes such as formaldehyde and glutaraldehyde, 
and lipids with osmium tetroxide. 

• Cryofixation - freezing a specimen so rapidly, to 
liquid nitrogen or even liquid helium temperatures, 
that the water forms vitreous (non-crystalline) ice. 
This preserves the specimen in a snapshot of its 
solution state. An entire field called cryo-electron 
microscopy has branched from this technique. With 
the development of cryo-electron microscopy of 

vitreous sections (CEMOVIS), it is now possible to observe samples from virtually any 
biological specimen close to its native state. 

• Dehydration - freeze drying, or replacement of water with organic solvents such as 
ethanol or acetone, followed by critical point drying or infiltration with embedding resins. 

• Embedding, biological specimens - after dehydration, tissue for observation in the 
transmission electron microscope is embedded so it can be sectioned ready for viewing. 
To do this the tissue is passed through a 'transition solvent' such as epoxy propane and 
then infiltrated with a resin such as Araldite epoxy resin; tissues may also be embedded 
directly in water-miscible acrylic resin. After the resin has been polymerised (hardened) 
the sample is thin sectioned (ultrathin sections) and stained - it is then ready for viewing. 

• Embedding, materials - after embedding in resin, the specimen is usually ground and 
polished to a mirror-like finish using ultra-fine abrasives. The polishing process must be 
performed carefully to minimize scratches and other polishing artifacts that reduce 
image quality. 

• Sectioning - produces thin slices of specimen, semitransparent to electrons. These can be 
cut on an ultramicrotome with a diamond knife to produce ultrathin slices about 60-90nm 
thick. Disposable glass knives are also used because they can be made in the lab and are 
much cheaper. 

• Staining - uses heavy metals such as lead, uranium or tungsten to scatter imaging 
electrons and thus give contrast between different structures, since many (especially 
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biological) materials are nearly "transparent" to electrons (weak phase objects). In 
biology, specimens are can be stained "en bloc" before embedding and also later after 
sectioning. Typically thin sections are stained for several minutes with an aqueous or 
acoholic solution of uranyl acetate followed by aqueous lead citrate. 

• Freeze-fracture or freeze-etch - a preparation method particularly useful for examining 
lipid membranes and their incorporated proteins in "face on" view. The fresh tissue or 
cell suspension is frozen rapidly (cryofixed), then fractured by simply breaking or by 
using a microtome while maintained at liquid nitrogen temperature. The cold fractured 
surface (sometimes "etched" by increasing the temperature to about -100°C for several 
minutes to let some ice sublime) is then shadowed with evaporated platinum or gold at 
an average angle of 45° in a high vacuum evaporator. A second coat of carbon, 
evaporated perpendicular to the average surface plane is often performed to improve 
stability of the replica coating. The specimen is returned to room temperature and 
pressure, then the extremely fragile "pre-shadowed" metal replica of the fracture surface 
is released from the underlying biological material by careful chemical digestion with 
acids, hypochlorite solution or SDS detergent. The still-floating replica is thoroughly 
washed from residual chemicals, carefully fished up on EM grids, dried then viewed in 
the TEM. 

• Ion Beam Milling - thins samples until they are transparent to electrons by firing ions 
(typically argon) at the surface from an angle and sputtering material from the surface. A 
subclass of this is Focused ion beam milling, where gallium ions are used to produce an 
electron transparent membrane in a specific region of the sample, for example through a 
device within a microprocessor. Ion beam milling may also be used for cross-section 
polishing prior to SEM analysis of materials that are difficult to prepare using mechanical 
polishing. 

• Conductive Coating - An ultrathin coating of electrically-conducting material, deposited 
either by high vacuum evaporation or by low vacuum sputter coating of the sample. This 
is done to prevent the accumulation of static electric fields at the specimen due to the 
electron irradiation required during imaging. Such coatings include gold, gold/palladium, 
platinum, tungsten, graphite etc. and are especially important for the study of specimens 
with the scanning electron microscope. Another reason for coating, even when there is 
more than enough conductivity, is to improve contrast, a situation more common with the 
operation of a FESEM (field emission SEM). When an osmium coater is used, a layer far 
thinner than would be possible with any of the previously mentioned sputtered coatings 
is possible. 12] 
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Disadvantages 

Electron microscopes are expensive to build and 
maintain, but the capital and running costs of confocal 
light microscope systems now overlaps with those of 
basic electron microscopes. They are dynamic rather 
than static in their operation, requiring extremely 
stable high-voltage supplies, extremely stable currents 
to each electromagnetic coil/lens, continuously-pumped 
high- or ultra-high-vacuum systems, and a cooling 
water supply circulation through the lenses and pumps. 
As they are very sensitive to vibration and external 
magnetic fields, microscopes designed to achieve high 
resolutions must be housed in stable buildings 
(sometimes underground) with special services such as 
magnetic field cancelling systems. Some desktop low 
voltage electron microscopes have TEM capabilities at 
very low voltages (around 5 kV) without stringent 
voltage supply, lens coil current, cooling water or 
vibration isolation requirements and as such are much 
less expensive to buy and far easier to install and 
maintain, but do not have the same ultra-high (atomic 
scale) resolution capabilities as the larger instruments. 




Pseudocolored SEM image of the 

feeding basket of Antarctic krill. Real 

electron microscope images do not 

carry any color information; they are 

greyscale. The first degree filter setae 

carry in v-form two rows of second 

degree setae, pointing towards the 

inside of the feeding basket. The 

purple ball is one micrometer in 

diameter. To display the total area of 

this structure one would have to tile 

this image 7500 times. 



The samples largely have to be viewed in vacuum, as 

the molecules that make up air would scatter the electrons. One exception is the 
environmental scanning electron microscope, which allows hydrated samples to be viewed 
in a low-pressure (up to 20 Torr/2.7 kPa), wet environment. 

Scanning electron microscopes usually image conductive or semi-conductive materials best. 
Non-conductive materials can be imaged by an environmental scanning electron 
microscope. A common preparation technique is to coat the sample with a 
several-nanometer layer of conductive material, such as gold, from a sputtering machine; 
however, this process has the potential to disturb delicate samples. 

Small, stable specimens such as carbon nanotubes, diatom frustules and small mineral 
crystals (asbestos fibres, for example) require no special treatment before being examined 
in the electron microscope. Samples of hydrated materials, including almost all biological 
specimens have to be prepared in various ways to stabilize them, reduce their thickness 
(ultrathin sectioning) and increase their electron optical contrast (staining). These 
processes may result in artifacts, but these can usually be identified by comparing the 
results obtained by using radically different specimen preparation methods. It is generally 
believed by scientists working in the field that as results from various preparation 
techniques have been compared and that there is no reason that they should all produce 
similar artifacts, it is reasonable to believe that electron microscopy features correspond 
with those of living cells. In addition, higher-resolution work has been directly compared to 
results from X-ray crystallography, providing independent confirmation of the validity of 
this technique. Since the 1980s, analysis of cryofixed, vitrified specimens has also become 
increasingly used by scientists, further confirming the validity of this technique. 13] [14] [15] 
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Applications 



Semiconductor and data storage 

• Circuit edit 

• Defect analysis 

• Failure analysis 

Biology and life sciences 

Diagnostic electron microscopy 

Cryobiology 

Protein localization 

Electron tomography 

Cellular tomography 

Cryo-electron microscopy 

Toxicology 

Biological production and viral load monitoring 

Particle analysis 

Pharmaceutical QC 
Structural biology 
3D tissue imaging 
Virology 
Vitrification 



Research 

• Electron beam induced depostion 

• Materials qualification 

• Materials and sample preparation 

• Nanoprototyping 

• Nanometrology 

• Device testing and characterization 

Industry 

High-resolution imaging 
2D & 3D micro-characterization 
Macro sample to nanometer metrology 
Particle detection and characterization 
Direct beam-writing fabrication 
Dynamic materials experiments 

Sample preparation 

Forensics 

Mining (mineral liberation analysis) 

Chemical/Petrochemical 



See also 

• Category: Electron microscope images 

• Field emission microscope 

• Scanning tunneling microscope 
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External links 

• Cell Centered Database - Electron microscopy data (http://ccdb.ucsd.edu/sand/ 
main?typeid=4&event=showMPByType&start=l) 

General 

• Nanohedron.com | Nano image gallery (http://www.nanohedron.com/) beautiful 
images generated with electron microscopes. 

• electron microscopy (http://www.microscopy.ethz.ch) Website of the ETH Zurich: Very 
good graphics and images, which illustrate various procedures. 

• Environmental Scanning Electron Microscope (ESEM) (http://www.danilatos.com) 

• X-ray element analysis in electron microscope (http://www.microanalyst.net/index_e. 
phtml) - Information portal with X-ray microanalysis and EDX contents 

History 

• John H.L. Watson: Very early Electron Microscopy in the Department of Physics, the 
University of Toronto - A personal recollection (http://www.physics.utoronto.ca/ 
overview/history/microsco) 

• Rubin Borasky Electron Microscopy Collection, 1930-1988 (http://americanhistory.si. 
edu/archives/d8452.htm) Archives Center, National Museum of American History, 
Smithsonian Institution. 

Other 

• The Royal Microscopical Society, Electron Microscopy Section (UK) (http://www.rms. 
org.uk/em.shtml) 

• Albert Lleal microphotography. Scanning Electron Microphotography Coloured SEM 
(http://www.albertlleal.com/microphotography.html) 
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X-ray microscope 



An X-ray microscope uses electromagnetic radiation in the soft X-ray band to produce 
images of very small objects. 

Unlike visible light, X-rays do not reflect or refract easily, and they are invisible to the 
human eye. Therefore the basic process of an X-ray microscope is to expose film or use a 
charge-coupled device (CCD) detector to detect X-rays that pass through the specimen. It is 
a contrast imaging technology using the difference in absorption of soft x-ray in the water 
window region (wavelength region: 2.3 - 4.4 nm, photon energy region: 0.28 - 0.53 keV) by 
the carbon atom (main element composing the living cell) and the oxygen atom (main 
element for water). 

Early X-ray microscopes by Paul Kirkpatrick and Albert Baez used grazing-incidence 
reflective optics to focus the X-rays, which grazed X-rays off parabolic curved mirrors at a 
very high angle of incidence. An alternative method of focusing X-rays is to use a tiny 
fresnel zone plate of concentric gold or nickel rings on a silicon dioxide substrate. Sir 
Lawrence Bragg produced some of the first usable X-ray images with his apparatus in the 
late 1940's. 

In the 1950's Newberry produced a shadow X-ray 
microscope which placed the specimen between the 
source and a target plate, this became the basis for 
the first commercial X-ray microscopes from the 
General Electric Company. 

The Advanced Light Source (ALS)[1] in Berkeley CA 
is home to XM-1 (http://www.cxro.lbl.gov/BL612/ 
), a full field soft X-ray microscope operated by the 
Center for X-ray Optics [2] and dedicated to various 
applications in modern nanoscience, such as 
nanomagnetic materials, environmental and 
materials sciences and biology. XM-1 uses an X-ray 
lens to focus X-rays on a CCD, in a manner similar 
to an optical microscope. XM-1 still holds the world 
record in spatial resolution with Fresnel zone plates 
down to 15nm and is able to combine high spatial 
resolution with a sub- 10 Ops time resolution to study 
e.g. ultrafast spin dynamics. 

The ALS is also home to the world's first soft x-ray microscope designed for biological and 
biomedical research. This new instrument, XM-2 was designed and built by scientists from 
the National Center for X-ray Tomography (http:/ / next. lbl. gov). XM-2 is capable of 
producing 3-Dimensional tomograms of cells. 

Sources of soft X-rays suitable for microscopy, such as synchrotron radiation sources, have 
fairly low brightness of the required wavelengths, so an alternative method of image 
formation is scanning transmission soft X-ray microscopy. Here the X-rays are focused to a 
point and the sample is mechanically scanned through the produced focal spot. At each 
point the transmitted X-rays are recorded with a detector such as a proportional counter or 
an avalanche photodiode. This type of Scanning Transmission X-ray Microscope (STXM) 




Indirect drive laser inertial confinement 

fusion uses a "hohlraum" which is 

irradiated with laser beam cones from 

either side on it its inner surface to bathe a 

fusion microcapsule inside with smooth 

high intensity X-rays. The highest energy 

X-rays which penetrate the hohlraum can 

be visualized using an X-ray microscope 

such as here, where X-radiation is 

represented in orange/red. 
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was first developed by researchers at Stony Brook University and was employed at the 
National Synchrotron Light Source at Brookhaven National Laboratory. 

The resolution of X-ray microscopy lies between that of the optical microscope and the 
electron microscope. It has an advantage over conventional electron microscopy in that it 
can view biological samples in their natural state. Electron microscopy is widely used to 
obtain images with nanometer level resolution but the relatively thick living cell cannot be 
observed as the sample has to be chemically fixed, dehydrated, embedded in resin, then 
sliced ultra thin. However, it should be mentioned that cryo-electron microscopy allows the 
observation of biological specimens in their hydrated natural state. Until now, resolutions 
of 30 nanometer are possible using the Fresnel zone plate lens which forms the image using 
the soft x-rays emitted from a synchrotron. Recently, more researchers have begun to use 
the soft x-rays emitted from laser-produced plasma rather than synchrotron radiation. 

Additionally, X-rays cause fluorescence in most materials, and these emissions can be 
analyzed to determine the chemical elements of an imaged object. Another use is to 
generate diffraction patterns, a process used in X-ray crystallography. By analyzing the 
internal reflections of a diffraction pattern (usually with a computer program), the 
three-dimensional structure of a crystal can be determined down to the placement of 
individual atoms within its molecules. X-ray microscopes are sometimes used for these 
analyses because the samples are too small to be analyzed in any other way. 



See also 

• Synchrotron X-ray tomographic 
microscopy 

External links 

• Application of X-ray microscopy in 
analysis of living hydrated cells [18] 

• Hard X-ray microbeam experiments with 
a sputtered-sliced Fresnel zone plate and 
its applications [ ] 

• Scientific applications of soft x-ray 

[4] 

microscopy L J 

• Microarrays products [5] 




A square beryllium foil mounted in a steel case to be 

used as a window between a vacuum chamber and an 

X-ray microscope. Beryllium, due to its low Z number 

is highly transparent to X-rays. 
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